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The  SETAC  Special  Publications  Series  was  established  by  the  Society  of  Environmental 
Toxicology  and  Chemistry  (SETAC)  to  provide  in-depth  reviews  and  critical  appraisals  on 
scientific  subjects  relevant  to  understanding  the  impacts  of  chemicals  and  technology  on 
the  environment.  The  series  consists  of  single-  and  multiple-authored  or  edited  books  on 
topics  reviewed  and  recommended  by  the  SETAC  Board  of  Directors  and  approved  by  the 
Publications  Advisory  Council  for  their  importance,  timeliness,  and  contribution  to  mul¬ 
tidisciplinary  approaches  to  solving  environmental  problems.  The  diversity  and  breadth 
of  subjects  covered  in  the  series  reflects  the  wide  range  of  disciplines  encompassed  by 
environmental  toxicology,  environmental  chemistry,  and  hazard  and  risk  assessment. 
Despite  this  diversity,  the  goals  of  these  volumes  are  similar;  they  are  to  present  the 
reader  with  authoritative  coverage  of  the  literature,  as  well  as  paradigms,  methodologies 
and  controversies,  research  needs,  and  new  developments  specific  to  the  featured  topics. 
All  books  in  the  series  are  peer  reviewed  for  SETAC  by  acknowledged  experts. 

The  SETAC  Special  Publications  are  useful  to  environmental  scientists  in  research,  re¬ 
search  management,  chemical  manufacturing,  regulation,  and  education,  as  well  as  to 
students  considering  careers  in  these  areas.  The  series  provides  information  for  keeping 
abreast  of  recent  developments  in  familiar  subject  areas  and  for  rapid  introduction  to 
principles  and  approaches  in  new  subject  areas. 

Ecological  Risk  Assessment  of  Contaminated  Sediments  presents  the  collected  papers  stem¬ 
ming  from  a  SETAC-sponsored  Pellston  Workshop  on  Sediment  Ecological  Risk 
Assessment,  held  in  Pacific  Grove,  California,  23-28  April  1995.  The  workshop  focused 
on  discussions  of  unresolved  scientific  issues  and  needed  research  in  the  area  of  sediment 
ecological  risk  assessment.  Like  all  previous  SETAC  workshops,  participation  was  limited 
to  invited  experts  from  government,  academia,  and  industry  who  were  selected  because 
of  their  experience  with  the  workshop  topic.  The  workshop  provided  a  structured  envi¬ 
ronment  for  the  exchange  of  ideas  and  debate  such  that  consensus  positions  would  be 
derived  and  documented  for  some  of  the  issues  surrounding  the  science  of  sediment  eco¬ 
logical  risk  assessment. 
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Preface 


This  book  presents  the  proceedings  of  the  22nd  Pellston  Workshop,  held  23-28  April 
1995  in  Pacific  Grove,  California,  where  the  workshop  series  began  in  1977.  Like  previous 
workshops,  participation  was  limited  to  invited  experts  from  government,  academia,  and 
industry  who  were  selected  because  of  their  experience  with  the  workshop  topic.  The 
workshop  provided  a  structured  environment  for  the  exchange  of  ideas  and  debate  such 
that  consensus  positions  would  be  derived  and  documented  for  some  of  the  issues  sur¬ 
rounding  the  science  of  sediment  ecological  risk  assessment.  The  proceedings  reflect  the 
current  state-of-the-art  of  these  topics  and  focus  on  principles  and  practices  designed  to 
improve  the  scientific  and  regulatory  communities’  ability  to  assess  environmental  risks 
associated  with  contaminated  sediments. 
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Foreword 


This  workshop  was  a  continuation  of  a  series  of  successful  workshops  called  the  “Pellston 
Workshop  Series.”  Since  1977,  twenty-six  workshops  have  been  held  at  Pellston  and  sev¬ 
eral  other  locations  to  evaluate  current  and  prospective  environmental  issues.  Each  has 
focused  on  a  relevant  environmental  topic,  and  the  proceedings  of  each  have  been  pub¬ 
lished  as  a  peer-reviewed  or  informal  report.  These  documents  have  been  widely  dis¬ 
tributed  and  are  valued  by  environmental  scientists,  engineers,  regulators,  and  managers 
because  of  their  technical  basis  and  their  comprehensive,  state-of-the-science  reviews. 
The  workshops  in  the  Pellston  Series  are  as  follows: 

•  Estimating  the  Hazard  of  Chemical  Substances  to  Aquatic  Life.  Held  in  Pellston, 
Michigan,  June  13-17, 1977.  Proceedings  published  by  the  American  Society  for 
Testing  and  Materials,  STP  657,  in  1978. 

•  Analyzing  the  Hazard  Evaluation  Process.  Held  in  Waterville  Valley,  New 
Hampshire,  August  14-18, 1978.  Proceedings  published  by  The  American 
Fisheries  Society  in  1979. 

•  Biotransformation  and  Fate  of  Chemicals  in  the  Aquatic  Environment.  Held  in 
Pellston,  Michigan,  August  14-18, 1979.  Proceedings  published  by  The  Ameri¬ 
can  Society  of  Microbiology  in  1980. 

•  Modeling  the  Fate  of  Chemicals  in  the  Aquatic  Environment.  Held  in  Pellston, 
Michigan,  August  16-21, 1981.  Proceedings  published  by  Ann  Arbor  Science 
in  1982. 

•  Environmental  Hazard  Assessment  of  Effluents.  Held  in  Cody,  Wyoming, 

August  23—27, 1982.  Proceedings  published  in  a  SETAC  Special  Publication  by 
Pergamon  Press  in  1986. 

•  Fate  and  Effects  of  Sediment-Bound  Chemicals  in  Aquatic  Systems.  Held  in 
Florissant,  Colorado,  August  11-18, 1984.  Proceedings  published  in  a  SETAC 
Special  Publication  by  Pergamon  Press  in  1987. 

•  Research  Priorities  in  Environmental  Risk  Assessment.  Held  in  Breckenridge, 
Colorado,  August  16-21, 1987.  Proceedings  published  by  SETAC  in  1987. 

•  Biomarkers:  Biochemical  Physiological  and  Histological  Markers  of  Anthropo¬ 
genic  Stress.  Held  in  Keystone,  Colorado,  July  23-28, 1989.  Proceedings 
published  in  a  SETAC  Special  Publication  by  Lewis  Publishers  in  1992. 

•  Population  Ecology  and  Wildlife  Toxicology  of  Agricultural  Pesticide  Use:  A 
Modeling  Initiative  for  Avian  Species.  Held  in  Kiawah  Island,  South  Carolina, 
July  22—21, 1990.  Proceedings  published  in  a  SETAC  Special  Publication  by 
Lewis  Publishers  in  1993. 

•  A  Technical  Framework  for  [Product]  Life-Cycle  Assessments.  Held  in  Smuggler  s 
Notch,  Vermont,  August  18-23, 1990.  Proceedings  published  by  SETAC  in 
January  1991,  with  second  printing  in  September  1991  and  third  printing  in 
March  1994. 
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•  Aquatic  Microcosms  for  Ecological  Assessment  of  Pesticides,  Held  in  Wintergreen, 
Virginia,  October  7-11, 1991.  Interim  Report  published  February,  1992. 
Proceedings  to  be  published  by  SETAC  in  1997. 

•  A  Conceptual  Frameworkfor  Life-  Cycle  Assessment  Impact  Assessment.  Held  in 
Sandestin,  Florida,  February  1-6, 1992.  Proceedings  published  by  SETAC 
in  1993. 

•  A  Mechanistic  Understanding  of  Bioavailability:  Physical-Chemical  Interac¬ 
tions.  Held  in  Pellston,  Michigan,  August  17-22, 1992.  Proceedings  published 
in  a  SETAC  Special  Publication  by  Lewis  Publishers  in  1994. 

•  Life- Cycle  Assessment  Data  Quality  Workshop.  Held  in  Wintergreen,  Virginia, 
October  4-9, 1992.  Proceedings  published  by  SETAC  in  1994. 

•  Avian  Radio  Telemetry  in  Support  of  Pesticide  Field  Studies.  Held  in  Pacific 
Grove,  California,  January  5-8, 1993.  Proceedings  to  be  published  by  SETAC 
in  1997. 

•  Sustainability-Based  Environmental  Management.  Held  in  Pellston,  Michigan, 
August  25-31, 1993.  Co-sponsored  by  the  Ecological  Society  of  America. 
Proceedings  to  be  published  by  SETAC  in  1997. 

•  Environmental  Risk  Assessment  for  Organochlorine  Compounds.  Held  in  Alliston, 
Ontario,  Canada,  July  25-29, 1994.  Proceedings  to  be  published  by  SETAC  in 
1997. 

•  Application  of  Life- Cycle  Assessment  to  Public  Policy.  Held  in  Wintergreen, 
Virginia,  August  14-19, 1994.  Proceedings  to  be  published  by  SETAC  in  1997. 

•  Ecological  Risk  Assessment  Modeling  Systems.  Held  in  Pellston,  Michigan,  August 
23-28, 1994.  Proceedings  to  be  published  by  SETAC  in  1997. 

•  Avian  Toxicity  Testing.  Held  in  Pensacola,  Florida,  December  4-7, 1994.  Co¬ 
sponsored  by  OECD.  Proceedings  published  by  OECD  in  1997. 

•  Chemical  Ranking  and  Scoring.  Held  in  Sandestin,  Florida,  February  12-16, 1995. 
Proceedings  to  be  published  by  SETAC  in  1997. 

•  Sediments  Risk  Assessment.  Held  in  Pacific  Grove,  California,  April  23-28, 1995. 
Proceedings  published  by  SETAC  in  1997. 

•  Ecotoxicology  and  Risk  Assessment  for  Wetlands.  Held  in  Gregson,  Montana,  July 
30-August  3, 1995.  Proceedings  to  be  published  by  SETAC  in  1997. 

•  Uncertainty  in  Ecological  Risk  Assessment.  Held  in  Pellston,  Michigan,  August  2 3- 
28, 1995.  Proceedings  to  be  published  by  SETAC  in  1997. 

•  Whole  Effluent  Toxicity  Testing:  An  Evaluation  of  Methods  and  Prediction  of 
Receiving  System  Impacts.  Held  in  Pellston,  Michigan,  September  16-21, 1995. 
Proceedings  published  by  SETAC  in  1996. 
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•  Reassessment  of  Metals  Criteria  for  Aquatic  Life  Protection:  Priorities  for  Research 
and  Implementation,  Held  in  Pensacola,  Florida,  February  10-14, 1996.  Proceed¬ 
ings  published  by  SETAC  in  1997. 

Information  about  the  availability  of  workshop  reports  can  be  obtained  by  contacting: 

Society  of  Environmental  Toxicology  and  Chemistry  (SETAC) 

1010  North  12th  Avenue 
Pensacola,  FL  32501-3370 
U.S.A. 

T  904  469  1500  F  904  469  9778 
E  setac@setac.org 
http://www.setac.org 
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Christopher  G.  Ingersoll,  Ph.D.,  is  presently  an  aquatic  toxicologist  with  the  U.S.  Geo¬ 
logical  Survey  at  the  Midwest  Science  Center  in  Columbia,  Missouri.  He  received  his 
bachelors’  (1978)  and  masters’  (1982)  degrees  from  Miami  University  in  Oxford,  Ohio, 
and  his  doctorate  (1986)  from  the  University  of  Wyoming  in  Laramie.  Since  1986,  his 
research  has  focused  on  developing  and  applying  methods  for  investigating  the  bioavail¬ 
ability  of  contaminants  in  sediment.  He  has  coordinated  the  development  of  chronic 
toxicity  tests  that  have  been  used  to  evaluate  contaminated  sediments  in  several  locations 
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Executive  Summary 

Tom  Dillon,  Gregory  R.  Biddinger,  Christopher  G.  Ingersoll 


Overview 

Sediments  are  both  a  source  and  a  sink  for  persistent  contaminants  entering  the  aquatic 
environment.  Sediment  quality  assessment  methods  involving  sediment  toxicity  and 
bioaccumulation  testing  began  in  the  early  1980s  at  about  the  same  time  as  did  environ¬ 
mental  risk  assessment  (ERA)  procedures,  and  the  two  processes  have  evolved  in  parallel 
but  separately.  To  date,  sediment  assessment  procedures  have  not  been  formally  inte¬ 
grated  into  a  single  process.  The  purposes  of  the  SETAC  Sediment  Ecological  Risk 
Assessment  Workshop,  held  23—28  April  1995  in  Pacific  Grove,  California,  were  to  pro¬ 
vide 

1)  a  framework  for  integrating  sediment  quality  assessment  with  ERA, 

2)  guidance  for  three  specific  applications  (product  assessment,  dredging,  and  site 
cleanups),  and 

3)  a  list  of  actions  that  can  be  taken  to  advance  the  application  of  ERA  to  contami¬ 
nated  sediments. 

Steering  Committee  and  Workgroup  Organization 

The  Steering  Committee  (Table  1)  comprised  nine  individuals  and  a  representative  from 
the  SETAC  Foundation.  Membership  on  the  Steering  Committee  was  based  on 

1)  knowledge  of  and  experience  with  sediments  and  risk  assessment  procedures, 

2)  experience  with  as  broad  a  range  as  possible  of  sediment  applications, 

3)  knowledge  of  individuals  working  in  this  and  related  fields  (the  Steering 
Committee  chose  the  remaining  participants),  and 

4)  willingness  and  proven  ability  to  perform  organizational  tasks  before,  during, 
and  after  the  workshop. 

Participants  (Table  1)  were  chosen  by  the  Steering  Committee  based  on  requirements  for 
stakeholder  representation  and  international  expertise  in  all  areas  of  ERA  and  sediment 
quality  assessment.  The  workshop  was  organized  into  applications  and  issues  work¬ 
groups.  The  applications  workgroups  were  Product  Safety  Assessment  (Chapter  4), 
Navigational  Dredging  (Chapter  5),  and  Contaminated  Site  Cleanup  (Chapter  7).  The 
issues  workgroups  were  Ecological  Relevance  (Chapter  12)  and  Methodological  Uncer¬ 
tainty  (Chapters  17  and  18).  The  first  full  day  of  the  workshop  comprised  plenary 
presentations,  beginning  with  an  overview  of  ERA  (Chapter  1)  and  proceeding  through 
the  applications  and  issues  papers.  Three  subsequent  days  were  spent  primarily  in  work¬ 
groups.  At  the  end  of  the  second  day,  workgroups  provided  draft  outlines  of  their 
prospective  chapters,  which  subsequently  were  discussed  in  a  plenary  session.  The  third 
and  fourth  days  were  spent  in  workgroups,  with  brief  morning  plenaries  during  which 
the  Steering  Committee  provided  direction  and  clarification.  The  plenaries  also  helped 
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Exxon  Company  USA,  Benicia,  California 
EA  Engineering,  Hunt  Valley,  Maryland 
Kennecott  Corporation,  Salt  Lake  City,  Utah 
Wright  State  University,  Dayton,  Ohio 
EVS  Environmental  Consultants,  Vancouver,  British  Columbia 
Environment  Canada,  Burlington,  Ontario 

National  Oceanic  and  Atmospheric  Administration,  Seattle, 
Washington 

U.S.  Geological  Survey,  Columbia,  Missouri 
U.S.  Environmental  Protection  Agency,  Washington,  DC 
SETAC  Foundation,  Pensacola,  Florida 

Product  Safety  Assessment  Workgroup 

Procter  and  Gamble  Company,  Cincinnati,  Ohio 
Kennecott  Corporation,  Salt  Lake  City,  Utah 
Du  Pont  Agricultural  Products,  Wilmington,  Delaware 
Water  Research  Center,  United  Kingdom 
Monsanto,  St.  Louis,  Missouri 
Sierra  Club,  Madison,  Wisconsin 
Clemson  University,  Pendleton,  South  Carolina 
U.S.  Environmental  Protection  Agency,  Washington,  DC 
SETAC,  Pensacola,  Florida 

Navigational  Dredging  Workgroup  - 

Richard  Peddicord  (chair)  EA  Engineering,  Hunt  Valley,  Maryland 

Tom  Chase  U.S.  Environmental  Protection  Agency,  Washington,  DC 

Tom  Dillon  EA  Engineering,  Hunt  Valley,  Maryland 

Jim  McGrath  Port  of  Oakland,  Oakland,  California 

Wayne  R.  Munns  U.S.  Environmental  Protection  Agency,  Narragansett,  Rhode  Island 

Kees  van  de  Gucthe  Ministry  of  Transport,  Public  Works  and  Water,  The  Netherlands 

William  van  der  Schalie  U.S.  Environmental  Protection  Agency,  Washington,  DC 


Charles  A.  Pittinger  (chair) 
William  J.  Adams 
Joseph].  Dulka 
Rachel  Fleming 
Rich  Kimerle 
Patricia  E.  King 
Thomas  W.  La  Point 
Anthony  F.  Maciorowski 
Gregory  Schiefer 


Gregory  R.  Biddinger  (chair) 
Tom  Dillon  (chair) 

William  J.  Adams 
G.  Allen  Burton 
Peter  M.  Chapman 
Kristin  E.  Day 
Alyce  T.  Fritz 

Christopher  G.  Ingersoll 
William  van  der  Schalie 
Gregory  Schiefer 
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EVS  Environmental  Consultants,  Vancouver,  British,  Columbia 

Shell  Development  Company,  Houston,  Texas 

National  Oceanic  and  Atmospheric  Administration,  Seattle, 
Washington 

Environment  Canada,  Hull,  Quebec 
Menzie-Cura  and  Associates,  Chelmsford,  Massachusetts 
U.S.  Environmental  Protection  Agency,  Edison,  New  Jersey 

ENSR  Consulting  and  Engineering,  Ft.  Collins,  Colorado 

Ecological  Relevance  Workgroup 

Battelle  Pacific  Northwest  Laboratory,  Sequim,  Washington 

Exxon  Company  USA,  Benicia,  California 

Colorado  State  University,  Ft.  Collins,  Colorado 

Environment  Canada,  Burlington,  Ontario 

University  of  Western  Ontario,  London,  Ontario 

Western  Washington  University,  Bellingham,  Washington 

National  Oceanic  and  Atmospheric  Administration,  Ann  Arbor, 
Michigan 

National  Institute  for  Water  and  Atmospheric  Research,  Hamilton, 
New  Zealand 

U.S.  Environmental  Protection  Agency,  Washington  DC 
Department  of  Fisheries  and  Oceans,  Winnipeg,  Manitoba 
Oak  Ridge  National  Laboratory,  Oak  Ridge,  Tennessee 

Methodological  Uncertainty  Workgroup 

Keith  R.  Solomon  (chair)  University  of  Guelph,  Guelph,  Ontario 

Gerald  T.  Ankley  U.S.  Environmental  Protection  Agency,  Duluth,  Minnesota 

Renato  Baudo  Institute  Italiano  Idrobioligia,  Pallanza,  Italy 

G.  Allen  Burton  Wright  State  University,  Dayton,  Ohio 

Christopher  G.  Ingersoll  U.S.  Geological  Survey,  Columbia,  Missouri 

Wibert  Lick  University  of  California-Santa  Barbara,  Santa  Barbara,  California 

Samuel  N.  Luoma  U.S.  Geological  Survey,  Menlo  Park,  California 

Donald  D.  MacDonald  MacDonald  Environmental  Sciences,  Ladysmith,  British  Columbia 

Trefor  B.  Reynoldson  Environment  Canada,  Burlington,  Ontario 

Richard  C.  Swartz  U.S.  Environmental  Protection  Agency,  Newport,  Oregon 
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identify  and  promote  discussion  of  crosscutting  issues  and  provided  a  venue  for  progress 
reports.  The  final  day  was  spent  in  plenary,  with  each  workgroup  presenting  and  discuss¬ 
ing  the  results  of  their  deliberations.  Before  participants  left  the  workshop,  draft  products 
were  provided  on  diskette  and  hard  copy  and  were  subsequently  finalized  by  mail,  e-mail, 
and  fax.  Editing  by  the  Steering  Committee,  peer  review  of  the  invited  papers,  and  writ¬ 
ing  of  the  Executive  Summary  followed  the  completion  of  the  workshop. 

Workshop  Objective  and  Charges  to  Participants 

The  Pellston  Sediment  Ecological  Risk  Assessment  Workshop  brought  together  two  dis¬ 
ciplines  that  have  evolved  more  or  less  separately:  sediment  quality  assessment  and 
ecological  risk  assessment.  The  intent  was  to  provide  a  forum  for  synthesis  and  synergy 
among  workshop  participants  from  the  two  disciplines.  The  ultimate  goal  was  to  advance 
the  state-of-the-art  and  the  state-of-the-practice  of  sediment  assessment  by  incorporating 
appropriate  risk-based  principles  and  practices  and  by  attempting  to  balance  environ¬ 
mental  risks  with  the  cost  of  environmental  protection.  Although  concerns  regarding 
human  health  were  discussed  at  the  workshop,  the  primary  focus  was  on  procedures  for 
conducting  sediment  ecological  risk  assessments  (SERAs). 

All  participants  were  given  the  charge  to  address  the  following  issue: 

•  Critically  analyze  existing  approaches  for  evaluating  sediment  quality,  and 
identify  principles  and  practices  of  SERA  that  will  improve  the  scientific  and 
regulatory  communities’  ability  to  assess  environmental  risks  associated  with 
contaminated  sediments. 

Participants  also  were  asked  to  address  the  following  subsidiary  charges  as  appropriate: 

•  Identify  appropriate  and  ecologically  relevant  assessment  and  measurement 
endpoints. 

•  Identify  technically  sound  models  and  appropriate  extrapolations. 

•  Identify  significant  areas  of  uncertainty. 

•  Recommend  research  and  development  needs,  especially  those  reducing 
significant  uncertainties. 

•  Recommend  strategic  modifications  to  improve  sediment  quality  assessments 
through  the  use  of  risk-based  approaches. 

The  issues  workgroups  were  specifically  charged  to  critically  examine  and  identify  where, 
in  the  risk  assessment  process,  methodological  uncertainty  and  ecological  relevance  most 
influence  the  outcome. 

The  applications  workgroups  were  specifically  charged  to  develop  a  generic  risk  assess¬ 
ment  process  appropriate  to  a  specific  application  area  for  evaluating  environmental 
impacts. 

Workshop  summary 

The  workshop  was  held  in  Pacific  Grove,  California,  23-28  April  1995.  Participants  were 
assigned  to  one  of  five  working  groups,  three  devoted  to  specific  applications  and  two  to 
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special  issues.  The  applications  workgroups  were  Navigational  Dredging,  Product  Safety 
Assessment,  and  Contaminated  Site  Cleanup.  Special  issue  workgroups  were  Ecological 
Relevance  and  Methodological  Uncertainty.  Workshop  sessions  alternated  between  ple¬ 
nary  and  individual  workgroup  meetings. 

Findings  of  the  individual  workgroups  are  reported  in  the  chapters  that  follow.  This  sum¬ 
mary  describes  major  crosscutting  issues  as  well  as  important  findings  from  individual 
workgroups.  The  first  issue  to  emerge  was  a  clear  consensus  that  the  current  ERA  para- 
digm  (c.g.,  U.S.  Environmental  Protection  Agency  [USEPA]  1992)  is  appropriate  for 
assessing  sediment  quality  (Chapter  1).  It  is  a  very  useful  guide  to  organizing  issues  and 
identifying  site-specific  data  gaps.  This  is  an  important  and  highly  desirable  characteris¬ 
tic  because  SERAs  vary  so  greatly  in  scope,  content,  and  purpose. 

The  second  major  issue  to  emerge  was  the  interaction  between  risk  assessor  and  risk 
manager.  There  is  longstanding  precedent  in  human  health  risk  assessments  (HHRAs) 
and  ERAS  to  separate  risk  assessment  from  risk  management.  This  is  done  for  the  correct 
and  commendable  goal  of  ensuring  scientific  integrity  in  the  risk  assessment  process. 
However,  this  separation  has,  in  practice,  too  often  meant  a  lack  of  communication  be¬ 
tween  the  risk  assessor  and  risk  manager.  As  a  result,  risk  assessments  too  often  do  not 
meet  the  needs  of  the  risk  manager  (Chapter  2).  Most  workshop  participants  agreed  that 
an  active  risk  assessor-risk  manager  dialogue  is  important  but  were  unsure  as  to  the 
structure  of  such  a  dialogue  or  where  respective  roles  and  responsibilities  overlapped.  In 
the  opinion  of  many  participants,  this  topic  has  sufficient  merit  to  justify  its  own  work¬ 
shop. 

The  third  major  crosscutting  issue  was  the  development  of  and  relationship  between 
assessment  and  measurement  endpoints  for  SERAs.  Assessment  endpoints  are  highly  val¬ 
ued  characteristics  of  the  site  or  system  that  should  be  protected,  restored,  or 
remediated.  These  valued  characteristics  reflect  ecological  concerns  as  well  as  social  and 
political  issues.  Measurement  endpoints  are  specific  observations  reflecting  change  in  the 
valued  characteristic  of  the  assessment  endpoint.  During  SERAs,  data  from  multiple 
measurement  endpoints  are  gathered  to  assess  change  in  the  assessment  endpoint.  The 
assessment  endpoint  is  related  to  the  measurement  endpoints  through  exposure  (path¬ 
ways  and  routes)  as  well  as  through  chemical-specific  mechanisms  of  effect. 

Another  major  theme  to  emerge  from  the  workshop  was  the  recommendation  to  use  a 
weight-of-evidence  approach  in  SERAs.  In  a  weight-of-evidence  approach,  multiple  lines 
of  evidence  are  generated  to  support  decision-making.  The  implicit  corollary  is  that  no 
single  line  of  evidence  should  drive  decision-making  in  the  weight-of-evidence  approach. 
Some  lines  of  evidence  (measurement  endpoints)  can  be  “weighted,”  that  is,  valued  more 
highly  than  others.  For  example,  data  from  American  Society  for  Testing  and  Materials 
(ASTM)-standardized  sediment  bioassays  may  receive  greater  emphasis  than  would  re¬ 
sults  from  bioassays  still  under  development.  Likewise,  an  exposure  model  validated  in 
the  field  would  be  more  highly  valued  than  one  that  was  not.  While  workshop  partici¬ 
pants  expressed  a  clear  preference  for  the  weight-of-evidence  approach,  there  was  no 
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consensus  as  to  how  the  approach  would  incorporate  uncertainties  or  relate  assessment 
and  measurement  endpoints.  This  is  an  important  gap  because  the  multiple  lines  of  evi¬ 
dence  (measurement  endpoints)  generated  during  the  SERA  must  all  be  relational  to  the 
assessment  endpoint. 

Using  a  tiered  approach  in  SERAs  was  also  a  strong  consensus  theme  at  the  workshop. 
This  recommendation  has  consistently  emerged  at  previous  Pellston  Workshops  (see  the 
list  of  publications  from  Pellston  Workshops  in  the  Foreword).  In  a  tiered  approach,  in¬ 
creasingly  complex  and  usually  more  costly,  time-consuming  evaluations  are  undertaken 
only  as  required  to  quantify  and  reduce  uncertainties  associated  with  risk  estimates.  One 
proceeds  through  the  tiers  until  sufficient  information  with  an  acceptable  level  of  uncer¬ 
tainty  exists  to  make  risk  management  decisions.  For  sediments,  one  of  the  best  working 
examples  of  a  tiered  assessment  is  the  U.S.  Environmental  Protection  Agency/U.S.  Army 
Corps  of  Engineers  (USAGE)  four-tiered  system  for  evaluating  dredged  material  disposal 
in  the  aquatic  environment  (USEPA  and  USAGE  1991, 1994).  Although  not  currently  risk- 
based,  the  USEPA/USACE  tiered  assessment  has  been  used  for  many  years  to  successfully 
evaluate  diverse  sediments  under  a  range  of  disposal  conditions  (Chapter  5). 

Discussions  at  the  workshop  suggested  that  the  most  pressing  technical  advancements 
are  needed  in  the  area  of  exposure  assessment.  All  workgroups  cited  significant  data  gaps 
and  modeling  deficiencies  for  predicting  spatial-temporal  distributions  of  sediments  and 
sediment-associated  contaminants.  Sediment  transport  models  have  been  developed  and 
field  validated  only  for  coarse-grain  sandy  material.  Similar  models  for  fine-grain  sedi¬ 
ments  (where  most  contaminants  tend  to  reside)  lag  far  behind.  Models  predicting  the 
fate  of  sediment-associated  chemicals,  for  the  most  part,  are  not  field  validated  and  do 
not  consider  kinetics  they  assume  chemical  equilibrium).  Having  access  to  predictive 
exposure  models  is  especially  critical  to  risk  managers  because  sediment  risks  are  man¬ 
aged  by  reducing  or  eliminating  exposure,  not  by  altering  toxicity.  Thus,  the  paucity  of 
well-developed,  predictive  exposure  models  hampers  our  ability  to  manage  sediment 
risks. 

What  constitutes  a  suitable  “reference”  for  SERAs  was  an  important  issue  for  several 
workgroups.  Unlike  HHRA,  where  numerical  frames  of  reference  exist  {i.e,,  10  to  10  ® 
excess  cancer  risks),  ERA  lacks  corresponding  guidance.  For  sediment  assessments,  com¬ 
parison  to  a  reference  provides  the  primary  basis  for  data  interpretation  (Ghapter  5).  The 
“reference”  can  take  the  form  of  a  reference  sediment,  a  reference  benthic  community,  or 
a  toxicity  reference  value.  Several  workgroups  concluded  that  the  ecological  risk  of  sedi¬ 
ments  should  be  evaluated  not  in  the  absolute  sense  but  in  the  context  of  the  receiving 
environment.  This  strongly  suggests  that  the  sediment  reference  should  be  based  on 
characteristics  of  the  receiving  ecosystem. 

The  relationship  between  goals  of  the  sediment  risk  assessment  and  statistical  design  was 
discussed  in  the  context  of  uncertainty  analysis.  Risk  assessments  are  typically  structured 
to  be  environmentally  conservative.  That  is,  highly  conservative  screening  values,  calcu¬ 
lations,  and  model  assumptions  are  used  throughout.  This  minimizes  the  chance  of  false 
negative  errors  (concluding  there  is  no  problem,  when  in  fact  one  exists).  Statistical  de- 
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signs,  however,  usually  focus  on  the  probability  of  making  a  false  positive  error  (conclud¬ 
ing  there  is  a  problem,  when  in  fact  one  does  not  exist)  by  establishing  a  low  a  level. 
Future  statistical  designs  should  also  focus  on  addressing  the  goal  of  the  risk  assessment 
to  minimize  false  negative  errors  (Chapter  18). 

The  remainder  of  this  summary  provides  a  historical  perspective  vis  a  vis  another  Pellston 
Workshop  on  sediments  (“Historical  perspective:  comparison  to  Pellston  VI  [1984]”)  and 
major  SERA  research  and  development  needs  emerging  from  this  workshop  (“Priority 
research  and  development  needs”). 

Historical  perspective:  comparison  to  Pellston  VI  (1984) 

About  ten  years  ago,  another  SETAC  workshop  focused  on  sediment-associated  chemi¬ 
cals.  It  was  the  sixth  Pellston  Workshop  held  in  Florissant,  Colorado,  in  1984  (Dickson  et 
al  1987).  The  five  preceding  Pellston  Workshops  had  dealt  with  the  hazard  assessment 
process  and  how  chemicals  and  effluents  might  impact  the  aquatic  environment.  The 
emphasis  was  generally  on  dissolved  chemicals  and  water  column  impacts.  Pellston  VI 
represented  the  increasing  realization  that  1)  many  chemicals  released  to  the  aquatic 
environment  ultimately  become  associated  with  bottom  sediments,  and  2)  the  assump¬ 
tion  of  irreversibility  of  chemicals  sorbed  to  sediments  was  wrong.  The  scientific  and 
regulatory  communities  began  to  redirect  their  attention  away  from  the  water  column 
toward  sediment-associated  chemicals  and  benthic  community  impacts.  The  terms  bio¬ 
availability,  partitioning,  and  bioaccumulation  came  into  more  common  and  widespread 
use. 

Because  the  1995  Pellston  SERA  Workshop  in  Pacific  Grove  dealt  with  the  ecological  risk 
of  sediments,  we  thought  it  would  be  insightful  to  draw  a  historical  perspective  with  Pell¬ 
ston  VI  by  examining  its  major  technical  conclusions  of  ten  years  ago.  Those  conclusions 
from  Pellston  VI  are  restated  below  as  they  originally  appeared. 

a)  Determining  exposure  concentrations  of  sediment-associated  chemicals  is 
essential  for  assessing  the  impact  of  contaminated  sediments  in  the  aquatic 
environment. 

b)  Equilibrium  partitioning  models  provide  an  estimate  of  the  maximum  amount 
of  sorbed  material  that  is  bioavailable. 

c)  A  hydrophobic  sorption  model:  Bioavailability  of  hydrophobic  solutes  is 
dependent  on  organic  carbon  content. 

d)  A  means  of  modeling  metal  sorption  to  sediment  has  been  proposed. 

e)  Sorption  models  must  be  fully  assessed  in  the  laboratory  before  field  testing. 

f )  The  relationship  between  fate,  distribution,  and  bioavailability  of  sediment- 
associated  chemicals,  and  the  oxidized  state  of  sediment  should  be  thoroughly 
investigated. 

g)  Test  methods  and  appropriate  organisms  should  be  recommended  for  assessing 
the  toxicity  of  chemicals  that  sorb  to  sediments. 
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h)  Bioaccumulation  of  sediment-associated  contaminants  by  aquatic  organisms  is 
not  presently  an  effective  measure  of  adverse  ecological  impact  because  of  the 
paucity  of  residue-effects  correlations. 

i)  Toxic  chemicals  that  sorb  to  sediment  are  a  potential  hazard  to  aquatic  systems, 

j)  At  this  time,  it  is  not  feasible  to  develop  numerical  sediment  quality  criteria. 

k)  An  approach  to  assess  the  hazard  of  sediment-bound  chemicals  is  needed. 

The  first  six  conclusions  (a  to  f )  involve  exposure  assessment  issues.  This  demonstrates 
that  ten  years  ago,  like  today,  exposure  was  a  central  issue  in  assessing  the  potential  im¬ 
pacts  of  sediment-associated  chemicals.  A  major  contribution  that  emerged  from 
Pellston  VI  was  the  concept  of  equilibrium  partitioning  (EqP).  The  workshop  partici¬ 
pants  developed  a  consensus  partitioning  model  for  nonpolar  organic  chemicals  using 
EqP.  Since  that  time,  numerous  laboratory  and  field  investigations  have  examined  the 
validity  of  EqP  for  nonpolar  organics  under  equilibrium  conditions.  A  sorption  EqP- 
based  model  for  metals  was  also  proposed  at  Pellston  VI.  However,  since  that  time,  the 
importance  of  acid  volatile  sulfides  (AVS)  on  metal  bioavailability  has  been  more  widely 
recognized  and  intensely  studied  {e.g. ,  Di  Toro  et  al.  1990). 

Two  conclusions  of  Pellston  VI  (g,  h)  dealt  with  effects  assessment.  “Test  methods  and 
appropriate  organisms”  have  been  the  focus  of  considerable  research  and  development 
since  that  time,  and  as  a  result,  sediment  bioassays  for  both  freshwater  and  saltwater 
sediments  are  now  available.  Most  tests  measure  survival  following  short-term  exposures 
to  bedded  sediment.  Fewer  address  sublethal  impacts  (e.g,  growth,  reproduction)  follow¬ 
ing  chronic  exposures.  Continued  development  of  more  sensitive  and  ecologically 
relevant  endpoints  (e.g.,  chronic  effects  on  growth,  reproduction,  and  population  end¬ 
points)  has  the  potential  to  produce  better,  more  ecologically  relevant  measurement 
endpoints  for  sediment  risk  assessments.  The  “paucity  of  residue-effects  correlations” 
noted  at  the  Pellston  VI  Workshop  still  exists  today. 

The  last  three  conclusions  (i,  j,  k)  focus  on  the  question  of  how  to  assess  sediments  con¬ 
taining  anthropogenic  chemicals.  At  the  time  of  Pellston  VI,  workshop  participants  felt 
it  would  not  be  feasible  to  develop  numerical  sediment  quality  criteria  (SQC).  They  cited 
too  many  uncertainties  in  methodology  and  the  incomplete  validation  of  supporting 
theories.  They  concluded  that  sediment  assessments  should  be  made  on  a  case-by-case 
basis  because  the  physical  and  chemical  characteristics  of  each  sediment  are  practically 
unique.  Development  of  numerical  SQC,  however,  proceeded  after  Pellston  VI,  and  today 
a  number  of  chemical-specific  sediment  quality  values  exist  to  evaluate  sediments  (Chap¬ 
ter  18).  These  chemical  values  should  be  used  as  one  of  the  several  measures  in  the 
weight-of-evidence  approach  to  sediment  risk  assessments.  Finally,  the  Pellston  VI  par¬ 
ticipants  cited  the  need  for  an  “approach  to  assess  the  hazard  of  sediment-bound 
chemicals.”  Until  the  1995  Pellston  SERA  Workshop,  developing  such  an  approach  re¬ 
ceived  little  formal  attention.  Results  of  the  1995  workshop  suggest  that  a  generic 
approach  now  exists,  that  it  can  be  adapted  for  diverse  applications,  but  that  much  re¬ 
search  and  development  remains  to  be  done. 
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Priority  research  and  development  needs 

Research  and  development  (R&D)  needs  generated  by  individual  workgroups  at  the 
workshop  are  reported  in  the  chapters  that  follow.  Important  crosscutting  R&D  issues 
are  discussed  in  the  paragraph  below;  specific  R&D  needs  are  listed  at  the  end  of  this  sec¬ 
tion.  Both  are  organized  per  the  major  elements  of  the  SERA  paradigm:  problem 
formulation,  exposure  assessment,  effects  assessment,  and  risk  characterization. 

Resolving  exposure  assessment  issues  appears  to  be  a  problematic,  recurring  theme  and 
therefore  should  receive  priority  in  any  R&D  effort.  Not  only  are  exposure  models  tech¬ 
nically  challenging  but  also,  when  properly  developed  and  validated,  they  permit  risk 
managers  to  assess  alternative  solutions.  This  is  because  it  is  exposure,  not  toxicity,  that 
is  managed  to  reduce  risk.  Problem  formulation  issues  (assessment/measurement  end¬ 
points,  risk  assessor-risk  manager  interface)  were  viewed  as  critical  but  easier  to  address 
from  an  R&D  perspective.  The  problem  formulation  issues  are  important  to  address  be¬ 
cause  they  define  the  scope  and  technical  direction  for  SERAs.  Further  development  of 
chronic  sublethal  sediment  bioassays  with  ecologically  relevant  endpoints  (survival, 
growth,  reproduction,  population-level  endpoints)  and  residue-effects  relationships  for 
persistent,  bioaccumulative  chemicals  were  cited  as  top  priorities  for  effects  assessment 
research.  Finally,  major  R&D  needs  for  risk  characterization  were  techniques  to  1)  quan¬ 
titatively  integrate  effects  and  exposure  data,  2)  combine  multiple  lines  of  evidence  in  a 
weight-of-evidence  approach,  and  3)  assess  and  communicate  uncertainties  associated 
with  estimates  of  ecological  risk. 

Problem  formulation 

The  major  R&D  needs  associated  with  problem  formulation  are  these: 

•  Develop  guidance  for  selecting  ecologically  relevant  assessment  and  measure¬ 
ment  endpoints. 

•  Develop  a  weight-of-evidence  approach  for  linking  assessment  and  measure¬ 
ment  endpoints.  The  approach  should  be  consistent  yet  flexible  enough  for  a 
wide  range  of  SERAs. 

•  Develop  guidance  for  selecting  what  is  an  appropriate  reference  (i.e. ,  reference 
sediments,  reference  areas,  reference  toxicity  values,  reference  benthic  commu¬ 
nities). 

•  More  fully  develop  the  concept  of  the  risk  assessor-risk  manager  interface.  Help 
define  roles  and  responsibilities  for  scientists,  managers,  stakeholders,  and  the 
public. 

Exposure  assessment 

The  major  R&D  needs  associated  with  exposure  assessment  are  these: 

•  Develop  and  field-validate  exposure  models  for  predicting  space-time  distribu¬ 
tions  of  a)  fine-grain  sediments  and  b)  sediment-associated  chemicals  in  the 
food  web. 
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•  Develop  techniques  to  ensure  exposure  model  outputs  are  in  units  consistent 
with  effects  assessments  data  {e.g.,  mg  contaminant/kg  sediment,  mg  sus¬ 
pended  sediment/liter,  proportion  of  contaminated  bedded  sediment). 

•  Develop  predictive  exposure  models  for  metals  and  ionic  chemicals  sorbed  to 
sediments.  Field-validate  existing  models  for  neutral,  hydrophobic  organic 
chemicals. 

•  Develop  exposure  models/techniques  to  address  complex  mixtures  of  chemicals 
embedded  in  the  sediment  matrix. 

•  Develop  quantitative  tissue  residue-biological  effects  relationships  for  persis¬ 
tent  bioaccumulative  chemicals. 

Effects  assessment 

The  major  R&D  needs  associated  with  effects  assessment  are  these: 

•  Pursue  further  development  of  chronic  sediment  toxicity  tests  that  measure 
survival,  growth,  reproduction,  and  population-level  endpoints. 

•  Develop  technically  sound  interpretive  guidance  for  effects  assessment  tools 
{e.g.,  individual-to-population  and  lab-to-field  extrapolations). 

•  Develop  techniques  for  simulating  field  exposures  in  laboratory  sediment 
toxicity  tests  {e.g.,  time-variant  suspended  sediment  exposures,  multiple 
disposal  events,  field  gradients  representing  dilution  with  other  sediments). 

•  Develop  quantitative  tissue  residue-biological  effects  relationships  for  persis¬ 
tent  bioaccumulative  chemicals. 

•  Refine  approaches  for  selecting  what  constitutes  an  appropriate  reference  or 
range  of  reference  values. 

Risk  characterization 

The  major  R&D  needs  associated  with  risk  characterization  are  these: 

•  Develop  quantitative  techniques  to  integrate  effects  assessment  {e.g.,  exposure- 
response  relationships)  and  exposure  assessment  data  {e.g.,  field  gradients). 

•  Establish  qualitative  and  quantitative  uncertainty  analysis  procedures,  includ¬ 
ing  the  appropriate  use  of  uncertainty  factors,  the  integration  of  uncertainty 
from  multiple  lines  of  evidence  {e.g.,  toxicity,  chemistry,  benthic  analysis),  and 
the  identification/quantification  of  false  negative  and  false  positive  errors. 

•  Evaluate  the  impact  of  extrapolations  {e.g.,  lab-to-field,  species-to-species, 
response-to-response)  on  estimates  of  ecological  risk. 

•  Adapt  toxicity  identification  evaluation  (TIE)  procedures  currently  used  to 
evaluate  effluents  and  sediment  pore  water  for  whole  sediment  toxicity  tests. 

•  Develop  methods  to  assess  recovery  potential  following  exposure  to  sediment- 
associated  chemicals. 

•  Use  carefully  designed  field  studies  to  evaluate  the  predictive  ability  of  sediment 
quality  guidelines  (SQG),  including  the  potential  for  generating  false  negative 
and  false  positive  errors. 
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SESSION  1 

PLENARY  OVERVIEW  OF 

SEDIMENT  ECOLOGICAL  RISK  ASSESSMENT 

- Chapter  1 

Overview  of  the  ecological  risk 
assessment  framework 

Glenn  W.  Suterll 


1.1  Introduction 

Ecological  risk  assessment  (ERA)  is  the  estimation  of  the  likeUhood  of  undesired  effects  of 
human  actions  or  natural  events  and  the  accompanying  risks  to  nonhuman  organisms, 
populations,  and  ecosystems.  Ecological  risk  assessment  began  with  efforts  to  apply  the 
concepts  and  rigor  of  human  health,  engineering,  and  financial  risk  assessment  to  eco¬ 
logical  hazards.  Those  efforts  led  to  the  development  of  a  consensus  standard  framework 
for  ERA  by  the  National  Research  Council  (NRC),  U.S.  Environmental  Protection  Agency 
(USEPA),  and  others  (Barnthouse  and  Suter  1986;  USEPA 1992;  Suter  1993).  Ecological 
risk  assessment  differs  from  impact  assessment,  hazard  assessment,  and  other  environ¬ 
mental  assessment  techniques  in  the  following  points: 

•  It  has  a  standard  logical  structure. 

•  It  separates  assessment  from  management. 

•  It  has  clearly  defined  endpoints. 

•  It  explicitly  recognizes  the  role  of  uncertainty  in  decision-making. 

1.2  Standard  logical  structure 

Ecological  risk  assessment  is  characterized  by  a  standard  logical  structure  or  paradigm 
(Barnthouse  and  Suter  1986;  USEPA  1992;  Suter  1993).  This  structure  is  derived  from  a 
paradigm  for  human  health  risk  assessment  (HHRA)  but  has  been  modified  to  accommo¬ 
date  differences  between  ecological  systems  and  humans  (Figure  1-1).  The  principal  one 
is  that,  unlike  HHRA,  which  begins  by  identifying  the  hazard  {e.g,  the  chemical  is  a  car¬ 
cinogen),  ERA  begins  by  dealing  with  the  diversity  of  entities  and  responses  that  may  be 
affected,  of  interactions  and  secondary  effects  that  may  occur,  of  scales  at  which  effects 
may  be  considered,  and  of  modes  of  exposure.  These  issues  are  dealt  with  by  combining 
them  into  a  conceptual  model  of  the  relationships  among  sources,  agents,  transport  pro¬ 
cesses,  and  receptors. 

The  second  component  of  the  ecorisk  paradigm  involves  parallel  characterizations  of 
ecological  effects  and  exposure.  The  need  to  consider  both  the  magnitude  of  exposure 
and  the  effects  associated  with  varying  levels  of  exposure  may  seem  self-evident,  but  it 
precludes  some  commonly  employed  regulatory  approaches.  The  requirement  that  ef¬ 
fects  be  considered  precludes  using  exceedance  of  background  or  detection  limits  as  a 
criterion  for  action.  Similarly,  the  requirement  that  exposure  be  considered  precludes 
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banning  chemicals  simply  because  they  have  certain  toxicological  properties  such  as  car¬ 
cinogenicity  or  teratogenicity. 

The  third  component  of  an  ERA  is  risk  characterization.  It  includes  combining  exposure 
and  effects  information  to  estimate  the  magnitude  of  realized  effects,  to  estimate  the  un¬ 
certainties,  and  to  interpret  the  risks. 

1.3  Risk  assessors  and  risk  managers 

Probably  the  most  contentious  issue  in  risk  assessment  is  the  proper  relationship  of  risk 
assessors  to  risk  managers,  who  decide  what  actions  to  take  in  response  to  risks.  The 
conflict  arises  from  two  contravening  issues.  First,  decision-makers  should  not  have  the 
opportunity  to  manipulate  the  data  so  that  they  support  a  desired  decision.  This  concern 
is  an  argument  for  keeping  the  risk  manager  out  of  the  process  until  the  assessors  hand 
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him  or  her  the  results.  Second,  if  risk  assessors  are  given  free  rein,  they  may  provide  an 
unbiased  answer  to  the  wrong  question  or  may  introduce  their  own  biases.  This  concern 
is  an  argument  for  making  the  risk  manager  a  participant  in  the  risk  assessment  to  ensure 
that  it  is  relevant  and  that  any  value  judgments  are  those  of  the  responsible  party.  The 
currently  favored  solution  to  this  problem  is  to  involve  the  risk  manager  in  the  problem 
formulation  and  then  keep  him  or  her  out  of  the  process  until  the  results  are  presented. 

Many  scientists  involved  in  ecotoxicology  and  applied  ecology  do  not  appreciate  the  role 
of  the  risk  manager.  They  view  the  ERA  process  as  one  in  which  scientists  decide  what  is 
important  to  assess  and  whether  significant  effects  are  occurring,  leaving  the  risk  man¬ 
ager  simply  to  decide  what  to  do  about  them.  However,  risk  managers,  not  scientists,  are 
designated  representatives  of  the  public.  Scientists  advise  and  educate  risk  managers,  but 
they  are  not  the  responsible  parties. 

1.4  Clearly  defined  endpoints 

One  of  the  characteristics  of  risk  assessment  is  that  endpoints  are  clearly  and  operation¬ 
ally  defined,  e.g.,  the  likelihood  of  cancer  or  the  frequency  of  crashes.  Ecotoxicologists 
have  difficulty  defining  equivalent  endpoints.  However,  if  ERA  is  to  be  rigorous  and  per¬ 
suasive,  it  is  necessary  to  define  exactly  what  it  is  that  the  assessment  is  attempting  to 
estimate  (Suter  1989, 1993;  USEPA 1992).  Vague  phrases  like  “ecosystem  health”  will  not 
do;  neither  will  clearly  defined  but  arcane  properties  like  levels  of  heat  shock  proteins. 
The  selection  of  assessment  endpoints  is  probably  the  most  important  and  difficult  as¬ 
pect  of  the  scientist’s  interaction  with  the  risk  manager  during  problem  formulation. 

Another  aspect  of  endpoint  definition  that  is  often  neglected  is  the  relationship  of  the 
effects  measures  to  the  assessment  endpoints.  In  some  cases,  the  relationship  is  one  of 
correspondence.  If  the  assessment  endpoint  is  the  percent  reduction  in  species  richness 
or  abundance  of  benthic  macroinvertebrates,  then  those  properties  are  measurable  in 
many  contaminated  sites.  However,  if  the  assessment  is  based  on  toxicity  testing,  the 
relationship  of  measurement  endpoints  to  assessment  endpoints  is  more  problematical. 
What  does  it  mean  to  benthic  macroinvertebrate  species  richness  or  abundance  that  a 
given  percentage  of  Hyalella  azteca  die  in  a  sediment  toxicity  test? 

1.5  Uncertainty 

The  concept  of  risk  implies  some  degree  of  uncertainty  concerning  actual  effects  of  an 
action.  The  frank  acceptance  of  uncertainty  is  different  from  conventional  science  and 
prior  ecotoxicological  assessment  paradigms.  Conventional  science  requires  that  the  in¬ 
vestigators  continue  to  perform  studies  until  they  can  demonstrate  with  very  high  confi¬ 
dence  ii.e.,  95%)  that  the  hypothesized  phenomenon  is  real.  The  hazard  assessment 
paradigm  similarly  requires  that  one  continue  to  do  more  and  more  complex  toxicity 
tests  and  fate  studies  until  one  is  confident  of  the  acceptability  of  a  chemical  release 
(Cairns  etal.  1979).  However,  risk  assessment  accepts  the  common  sense  proposition  that 
decisions  must  be  made  under  conditions  of  significant  uncertainty.  Therefore,  there  is 
often  a  nontrivial  risk  that  an  undesired  outcome  will  occur. 
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Estimating  uncertainties  is  a  difficult  problem,  but  the  real  trick  is  determining  which 
uncertainties  are  relevant  to  the  decision  and  presenting  those  uncertainties  in  a  compre¬ 
hensible  and  useful  manner.  Dealing  with  uncertainty  by  employing  conservative  as¬ 
sumptions  is  no  longer  acceptable  (NRC  1994).  It  is  incumbent  upon  us  to  estimate 
effects  and  uncertainties  separately.  That  is,  we  must  estimate  the  most  likely  outcome 
and  the  likelihood  of  more  or  less  severe  outcomes. 

1 .6  Variety  in  assessment  practices 

Within  the  ecorisk  formalism,  there  is  considerable  variety  in  actual  practice  relative  to 
other  types  of  risk  assessment.  This  is  in  part  because  of  the  relative  novelty  of  ERA  and 
the  lack  of  guidance  from  regulatory  agencies.  More  fundamentally,  this  novelty  is  due  to 
the  variety  of  ecological  receptors,  hazardous  agents  to  be  assessed,  and  mandates  for 
assessment. 

The  most  important  distinction  is  the  one  between  entirely  predictive  assessments,  such 
as  new  product  registration,  and  retrospective  assessments  that  address  existing  con¬ 
tamination.  Predictive  assessments  rely  on  laboratory  testing  and  modeling.  Retrospec¬ 
tive  assessments  not  only  use  conventional  laboratory  testing  and  modeling  but  also  can 
use  biological  field  surveys  and  toxicity  tests  of  contaminated  media. 

1.7  Inference  in  risk  assessment 

Conclusions  concerning  ecological  risks  are  based  on  weight  of  evidence.  The  need  for 
weighing  of  evidence  is  obvious  in  ERAs  for  contaminated  sites  where  information  may 
be  available  concerning  the  concentrations  of  contaminants  in  various  media,  the  toxicity 
of  those  media,  and  the  biotic  communities  inhabiting  those  media.  However,  weighing 
of  evidence  is  needed  even  when  predictive  assessments  of  new  chemicals  are  performed. 
For  new  chemicals,  evidence  to  be  weighed  includes  results  of  various  toxicity  tests,  physi¬ 
cal-chemical  properties  of  the  chemical,  statistical  and  mathematical  models,  and  effects 
of  analogous  chemicals  that  have  been  released  in  the  past. 

As  far  as  possible,  the  weighing  of  evidence  should  be  performed  by  an  a  priori  logic,  not 
an  ad  hoc  on^.  The  best  known  example  of  such  a  logical  structure  is  Chapman  s  (1990) 
“sediment  quality  triad”  (Table  1-1).  The  principal  limitation  of  that  method  is  that  it 
does  not  explicitly  incorporate  variance  in  the  quality  of  the  lines  of  evidence.  For  ex¬ 
ample,  if  chemicals  are  not  measured  at  toxic  concentrations  and  toxicity  tests  are  nega¬ 
tive  but  the  community  is  altered  (Table  1-1,  line  5),  it  may  be  that  the  alteration  is  not 
due  to  toxic  chemicals  or  it  may  be  that  both  the  analytical  methods  and  the  toxicity  tests 
are  not  sufficiently  sensitive. 

1.8  Phasing  of  the  assessment 

Phasing  is  a  desirable  feature  of  ERAs,  but  unlike  hazard  assessments,  ERAs  do  not  re¬ 
quire  it.  ERAs  use  whatever  data  are  available  and  present  the  risk  manager  with  the 
choice  of  making  a  decision  with  the  current  level  of  uncertainty  or  of  reducing  uncer¬ 
tainty  by  performing  more  studies.  In  general,  it  is  advisable  to  gather  some  data  and 
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Table  1-1  Inference  based  on  the  "sediment  quality  triad” 


Situation 

Chemicals 

present 

Toxicity 

Community 

alteration 

Possible  conclusions 

1 

+ 

-1- 

+ 

Strong  evidence  for  pollution-induced 
degradation 

2 

- 

- 

- 

Strong  evidence  that  there  is  no  pollution - 
induced  degradation 

3 

+ 

- 

- 

Contaminants  are  not  bioavailable,  or  are 
present  at  nontoxic  levels 

4 

- 

+ 

- 

Unmeasured  chemicals  or  conditions 
exist,  with  potential  to  cause  degradation 

5 

- 

- 

+ 

Alteration  is  not  due  to  toxic  chemicals 

6 

+ 

-1- 

— 

Toxic  chemicals  are  stressing  the  system 
but  are  not  sufficient  to  significantly 
modify  the  community 

7 

- 

+ 

+ 

Unmeasured  toxic  chemicals  are  causing 
degradation 

8 

+ 

- 

+ 

Chemicals  are  not  bioavailable,  or 
alteration  is  not  due  to  toxic  chemicals 

Source:  Chapman  1990. 

Responses  are  shown  as  either  positive  (+)  or  negative  (-),  indicating  whether  or  not  measurable  (e.g.,  statistically 
significant)  differences  from  control/reference  conditions  are  determined. 


perform  a  preliminary  assessment  in  order  to  properly  perform  the  problem  formulation 
phase  of  the  definitive  ERA.  That  assessment  is  commonly  termed  a  screening  assess¬ 
ment  because  it  screens  out  certain  chemicals,  media,  receptors,  and  areas  from  further 
consideration. 

Care  must  be  taken  in  the  design  of  phased  studies.  The  logic  that  has  been  used  in  phas¬ 
ing  assessments  of  new  chemicals  (start  with  quick  but  insensitive  tests)  does  not  work 
for  other  sorts  of  assessments.  If  we  collect  sediments  from  a  contaminated  site  and  per¬ 
form  acute  lethality  tests,  we  have  no  appropriate  response  to  negative  results.  Unless  we 
are  unconcerned  about  sublethal  effects  or  effects  of  extended  exposures,  negative  results 
do  not  allow  us  to  either  declare  the  sediment  acceptable  or  identify  an  effect. 
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Integration  of  risk  assessment 
and  risk  management 

Anthony  F.  Maciorowski 

Scientists  and  risk  managers  must  interact  and  communicate  in  the  development  and 
interpretation  of  risk  assessments  and  their  final  applications.  However,  they  must  also 
learn  to  respect  each  other  s  spheres  of  expertise,  influence,  and  control  in  the  overall 
decision-making  process.  Integrated  decision-making  is  a  recent  and  rapidly  evolving 
process.  As  such,  the  respective  roles,  responsibilities,  information  needs,  and  process 
boundary  points  are  neither  well  defined  nor  well  understood.  As  an  example,  the  Frame¬ 
work  for  Ecological  Risk  Assessment  (USEPA 1992)  clearly  emphasizes  the  need  for  dis¬ 
cussions  between  risk  managers  and  risk  assessors  at  the  planning  stage,  at  key  decision 
points,  and  on  completion  of  the  risk  characterization.  However,  the  framework  provides 
little  specific  information  concerning  communication  and  process  issues  at  the  risk 
assessor-risk  manager  interface  or  concerning  how  to  incorporate  risk  mitigation  inter¬ 
actions  into  the  original  risk  assessment. 

Improved  understanding  of  the  different  perspectives  of  risk  assessors  and  risk  manag¬ 
ers  is  crucial  to  the  ultimate  success  of  integrated  decision-making  processes.  Risk  asses¬ 
sors  are  often  concerned  with  performing  risk  assessments  in  the  most  scientifically 
credible  manner  and  identifying  additional  data  or  research  to  better  characterize  risk. 
Risk  managers,  on  the  other  hand,  may  be  more  interested  in  integrating  ecological  risk 
conclusions  in  a  broader  risk  or  risk-benefit  framework  to  finalize  decisions  with  the 
information  in  hand.  This  may  include  opting  to  impose  risk  reduction  or  risk  mitigation 
control  practices  rather  than  undergoing  successive  iterations  of  the  original  risk  assess¬ 
ment. 

Risk  reduction  or  risk  mitigation  activities  are  defined  as  actions  to  reduce  or  eliminate 
adverse  human  health  or  environmental  effects  and  are  becoming  increasingly  important 
risk  management  tools.  Underlying  policy  implications  behind  risk  reduction  and  inte¬ 
grated  decision-making  are  detailed  in  the  strategic  initiatives  and  guiding  principles 
recently  released  by  USEPA  (1994),  including  ecosystem  protection,  pollution  preven¬ 
tion,  strong  science  and  data,  partnerships,  and  environmental  accountability.  In  es¬ 
sence,  the  emerging  policies  are  directed  toward  greater  participation  in  environmental 
problem-solving.  This  process  is  intended  to  engage  greater  participation  in  decision¬ 
making,  including  parties  affected  by  the  decision  (the  regulated  community,  user 
groups,  environmental  interest  groups,  and  the  general  public,  as  well  as  scientists.) 
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The  point  to  underscore  is  that  while  ERA  is  advancing  as  a  science,  so  too  are  risk  man¬ 
agement  environmental  policy  goals  and  initiatives  that  include  greater  public  participa¬ 
tion.  Under  these  circumstances,  it  is  imperative  that  risk  assessors  and  risk  managers 
better  understand  each  other  and  their  respective  processes.  Fortunately,  there  are  some 
precedents  for  this  understanding.  Risk  assessment  is  largely  a  scientific  process  that 
must  retain  elements  of  scientific  independence  and  rigor  to  ensure  that  risk  character¬ 
izations  provide  an  objective  evaluation  of  the  available  data  and  information.  There  is 
longstanding  precedent  for  separating  risk  assessment  and  risk  management  to  ensure 
that  scientific  integrity  is  maintained  in  decision-making  (NRC  1983, 1993;  Thomas 
1987,  USEPA 1992).  However,  separation  does  not  imply  an  absence  of  communication 
or  a  failure  to  understand  each  other  s  processes  and  needs. 

Risk  managers  are  the  ultimate  users  of  risk  assessments,  which  are  often  subject  to  con¬ 
straints  within  existing  legal,  policy,  and  economic  realities.  As  such,  risk  assessors  must 
be  aware  of  risk  management  needs  in  the  problem  formulation  stage,  to  ensure  that  as¬ 
sessment  endpoints  and  resolving  power  that  the  decision-maker  requires  are  under¬ 
stood.  Once  agreement  on  assessment  endpoints  necessary  to  decision-making  is 
reached,  it  is  within  the  realm  of  the  risk  assessor  to  select  the  measurement  endpoints 
necessary  to  achieve  the  goals  and  objectives  dictated  by  the  assessment  endpoints.  Ide¬ 
ally,  this  would  be  agreed  to  during  a  formal  a  priori  problem-formulation  step  in  the  risk 
assessment  process.  Unfortunately,  routine  problem  formulation  that  engages  both  risk 
assessors  and  risk  managers  has  not  been  a  common  practice.  Indeed,  much  conflict  and 
controversy  surrounding  the  application  of  risk  assessments  can  often  be  traced  to  the 
lack  of  a  formal  problem-formulation  step  in  the  decision  process.  As  such,  the  impor¬ 
tance  of  promoting  formal  problem  formulation  cannot  be  overstated. 

Once  a  risk  characterization  is  passed  to  a  risk  manager,  additional  risk  assessment-risk 
management  discussion  is  necessary.  Presented  with  a  scientific  evaluation  of  risk,  the 
risk  manager  may  want  additional  information  or  study,  or  may  need  to  act  on  the  infor¬ 
mation  in  hand  regardless  of  its  scientific  strengths  or  shortcomings.  Rather  than  refine 
the  risk  assessment,  the  risk  manager  may  opt  to  impose  mitigation  to  reduce  the  risk, 
even  in  the  face  of  uncertainty  that  the  mitigation  will  be  effective.  When  such  situations 
occur,  risk  assessors  must  clearly  and  succinctly  summarize  both  risk  and  mitigation 
options  for  the  benefit  of  risk  managers,  stakeholders  in  the  decision,  and  the  public  at 
large.  Further,  risk  assessors  must  be  willing  to  discuss  the  relative  merits  of  risk  mitiga¬ 
tion  even  in  the  absence  of  data. 

Although  there  is  general  agreement  that  risk  assessors  need  to  be  involved  in  risk  man¬ 
agement  decisions,  it  is  also  important  to  ensure  the  scientific  integrity  of  the  risk  assess¬ 
ment  process.  Once  a  risk  characterization  is  used  to  reach  a  decision,  the  risk  assessor 
rarely  has  an  opportunity  to  request  more  data  or  information  on  which  to  base  opinions 
or  recommendations.  More  importantly,  the  risk  assessor  has  now  moved  into  the  risk 
management  arena.  In  the  risk  management  decision  process,  the  risk  assessor  may  be 
asked  to  analyze  or  judge  the  effect  of  the  proposed  risk  mitigation  on  the  original  risk 
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assessment.  This  analysis  does  not  change  the  original  risk  assessment  but  begins  to  ana¬ 
lyze  whether  management  actions  such  as  mitigation  will  reduce  risk  to  acceptable  levels. 

Until  the  overall  integrated  decision-making  process  is  better  defined  and  understood  by 
both  risk  assessors  and  risk  managers,  there  will  undoubtedly  be  some  conflict  and  mis¬ 
understanding.  However,  recognizing  and  understanding  that  risk  assessors  and  risk 
managers  have  different  roles  and  responsibilities  should  go  a  long  way  toward  improv¬ 
ing  the  decision  process. 

This  paper  was  originally  a  contribution  to  the  Product  Safety  Assessment  Workgroup 
(Chapter  4);  the  editors  judged  the  issue  of  risk  assessment-risk  management  integration 
to  be  a  good  introduction  to  general  sediment  risk  assessment  issues. 
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Assessing  ecological  risks  to  benthic  species 
in  product  and  technology  development 

Charles  A.  Pittinger,  William  J.  Adams 

3.1  Introduction 

Those  involved  in  technology  development  in  both  the  private  and  public  sectors  share 
responsibility  for  ensuring  its  safety  in  all  relevant  environmental  compartments.  Tech¬ 
nological  development  of  new  products,  ingredients,  industrial  processes,  and  emissions 
requires  a  systematic  evaluation  of  potential  adverse  effects  to  freshwater  and  marine, 
pelagic,  and  benthic  communities.  Ecological  risk  assessment  as  originally  described  in 
the  first  SETAC  Pellston  Workshop  (Cairns  et  al  1978)  and  refined  by  the  USEPA  (1992) 
provides  a  sound  and  quantitative  framework  for  these  evaluations.  Consideration  of 
risks  pertaining  to  benthic  communities  and  the  sediments  they  occupy  is  an  integral 
element  in  the  broader  process  of  ERA. 

Recent  scientific,  engineering,  and  marketing  developments  have  greatly  improved  tech¬ 
nical  capabilities  and  available  resources  for  testing  and  research  and  for  performing 
ERA.  These  developments  have  enabled  companies  and  governmental  institutions  to 
better  evaluate  and  manage  sediment-related  risks  of  new  technologies. 

Major  developments  have  occurred  in  the  areas  of  environmental  fate  and  ecotoxico- 
logical  evaluation: 

•  Increased  understanding  of  natural  sorption  processes  and  maj  or  environmen¬ 
tal  parameters  that  affect  sorption  and  chemical  bioavailability 

•  Development  of  sediment  sorption  theories  for  nonionic  organics,  metals,  and 
some  polar  organics 

•  Development  of  sophisticated  models  for  estimating  sediment  partitioning, 
quantitative  structure-activity  relationships  (QSARs),  and  the  expansion  of 
personal  computing  technology  enabling  easier  access  and  broader  use  of  these 
models 

•  Increased  understanding  of  the  routes  of  exposure  to  benthic  organisms 
through  pore  water  and  sediment  ingestion,  and  better  extrapolations  among 
sensitivities  of  pelagic  and  benthic  organisms  to  chemicals 

•  Improved  techniques  for  estimating  toxicity  to  benthic  organisms,  including  the 
development  and  standardization  of  sensitive,  acute,  and  chronic  sediment 
toxicity  tests  for  a  broad  range  of  freshwater  and  marine  organisms 

•  Increased  availability  of  competent  contract  laboratories  capable  of  conducting 
sediment  toxicity  tests 
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•  Development  of  more  reliable  procedures  for  sampling,  handling,  and  storing 
sediment  samples 

While  considerable  progress  has  been  made  in  understanding  sediment  sorption  pro¬ 
cesses  as  they  relate  to  bioavailability,  more  work  remains  to  be  done  if  practical,  routine, 
and  protective  test  methods  and  criteria  for  screening  technological  impacts  to  benthic 
communities  are  to  be  developed.  This  chapter  provides  an  overview  of  how  sediment 
evaluations,  as  an  element  of  the  ERA  process,  can  successfully  be  integrated  into  technol¬ 
ogy  development.  A  fundamental  approach  for  screening  new  chemical  technologies  is 
presented  with  a  description  of  typical  sediment-related  testing  methodologies.  Some 
policy  considerations  pertaining  to  the  management  of  sediment-related  risks  are  also 
presented. 

3.2  Integrating  risk  assessment  with  technology 
development  needs 

Ultimate  responsibility  for  ensuring  the  ecological  safety  of  a  new  commercial  technol¬ 
ogy,  such  as  a  new  product  or  product  ingredient,  lies  with  the  producer  as  well  as  with 
the  users  or  consumers  of  that  technology.  The  distinction  between  ERA  and  risk  man¬ 
agement  (USEPA 1992)  is  entirely  appropriate  for  sediment-related  assessments  of  new 
technologies.  As  with  all  ERAs,  early  communication  by  the  risk  managers  to  those  in¬ 
volved  in  assessing  potential  technological  risks  to  benthic  communities  is  essential  (see 
Chapter  2).  The  risk  manager  should  convey  the  intended  use  of  the  assessment  (includ¬ 
ing  potential  regulatory  submissions),  the  resources  available  and  deemed  appropriate, 
and  the  critical  timelines,  depending  upon  other  steps  involved  in  technology  develop¬ 
ment  (Moore  and  Biddinger  1995).  These  interactions  help  to  ensure  that  the  ultimate 
risk  characterizations  are  relevant,  timely,  and  cost-effective.  These  properties  are  critical 
to  institutions  that  must  apply  risk  assessments  to  support  environmental  management 
and  decision-making  as  well  as  to  comply  with  regulatory  requirements  (White  et  al 
1995). 

Though  ERA  is  sometimes  viewed  as  predominantly  a  function  of  regulatory  agencies, 
the  same  assessment  approach  is  often  applied  by  industry  as  an  internal  technology 
development  tool.  Innovative  companies  rely  upon  ERA  to  identify  real  or  potential  prob¬ 
lems  associated  with  new  products  and  new  technologies,  recognizing  that  environmen¬ 
tal  factors  must  be  considered  in  conjunction  with  conventional  marketing  and 
manufacturing  factors.  For  this  reason,  many  companies  have  developed  and  imple¬ 
mented  environmental  policies  to  address  ecological  risks  throughout  the  development, 
manufacturing,  distribution,  and  marketing  process. 

Effective  implementation  of  these  environmental  policies  requires  the  following: 

•  A  high-level  commitment  by  the  company  or  institution  to  environmental 
quality,  involving  full  ownership  and  “buy-in”  at  the  company’s  highest  manage¬ 
ment  level 
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•  Access  to  advanced  testing  capabilities  in  environmental  toxicology,  microbiol¬ 
ogy,  and  chemistry.  Screening-level  tests  that  can  reliably  predict  vulnerabilities 
in  a  compound  s  safety  profile  are  essential  to  keep  pace  with  myriad  other 
variables  and  demands  in  bringing  a  new  technology  to  market.  Prediction  of 
biodegradability,  sorptivity,  and  bioaccumulation  potential  are  additional 
important  screening-level  testing  areas  that  can  identify  potential  sediment 
issues. 

•  An  effective  technology  development  process  that  incorporates  early  environ¬ 
mental  screening  and  assessment.  Companies  must  ensure  an  effective  manage¬ 
ment  system  that  incorporates  environmental  and  human  risk  assessment  into 
technology  development,  process  development,  product  development,  and 
manufacturing  processes.  Integration  at  an  early  stage  is  preferred,  as  competi¬ 
tive  market  forces  dictate  ambitious  and  precisely  controlled  timelines. 

•  Effective  communications  and  information  feedback  systems  across  product 
development,  process  development,  manufacturing,  and  marketing  operations. 
Communications  between  or  among  companies  engaged  in  customer-supplier 
relationships  are  necessary  to  ensure  comprehensive  understanding  and 
management  of  environmental  safety  across  the  overall  life  cycle  of  the  technol¬ 
ogy.  Corporations  involved  in  customer-supplier  relationships  frequently 
exchange  safety  information  and  cooperate  in  the  conduct  of  ERAs.  Companies 
supplying  ingredients  to  consumer  product  manufacturers  often  work  in 
tandem  to  generate  ecological  fate-and-effects  data  and  to  conduct  joint  risk 
assessments. 

3.3  Predicting  sediment  risk  through  early  screening  and 
chemical  property  evaluation 

Predictive  and  efficient  screening  of  chemicals  for  sediment-related  fate  and  effects  is 
indispensable  in  technology  development  because  early  product  development  efforts 
often  generate  large  numbers  of  candidate  substances.  Early  recognition  of  sediment 
partitioning  potential  is  necessary  to  plan  an  appropriate  risk  assessment  strategy  before 
major  commercial  decisions  and  investments  are  made.  Indiscriminate  benthic  testing  of 
all  candidate  substances  early  in  technology  development  is  neither  economically  feasible 
nor  necessary  to  ensure  safety.  Screening-level  assessments  using  physical-chemical  pa¬ 
rameters,  fundamental  fate  and  toxicity  trends  extrapolated  from  pelagic  testing,  and  the 
use  of  conservative  assumptions  and  assessment  factors  can  effectively  be  used  to  evalu¬ 
ate  safety  to  benthic  species  and  sediment  processes  early  in  the  technology  development 
process. 

Initial  predictions  of  sediment  and  porewater  concentrations  can  often  be  obtained  from 
octanol-water  partition  coefficients  and  structure-activity  relationships  (Cowan  et  al 
1995).  The  prediction  is  typically  conservative  and  assumes  no  burial  or  transformation 
{e.g.,  biodegradation,  hydrolysis)  to  reduce  concentrations  and  no  complexation  to  re¬ 
duce  bioavailability.  For  example,  screening-level  sediment  risk  assessments  may  be  con¬ 
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ducted  by  comparing  porewater  concentration  estimates  with  toxicity  data  for  pelagic 
test  organisms,  typically  fish  or  daphnids,  by  assuming  similar  sensitivities  and  routes  of 
exposure  as  benthic  organisms.  Screening-level  assessments  based  on  comparisons  with 
pelagic  species,  however,  cannot  simulate  biotic  and  abiotic  sediment  transformation 
and  redistribution  processes  that  may  alter  exposure  concentrations. 

Two  chemical  properties  useful  in  screening  potential  exposures  to  sediment-associated 
contaminants  are  persistence  and  sorptivity.  In  general,  chemicals  most  likely  to  occur  in 
appreciable  concentrations  in  sediments  exhibit  relatively  long  persistence  and  high 
sorptivity.  Highly  sorptive  but  readily  degradable  compounds,  under  typical  conditions, 
are  usually  degraded  before  reaching  toxic  thresholds  in  sediments  or  fauna  (Versteeg 
and  Shorter  1992).  Recalcitrant  but  nonsorptive  compounds  are  typically  evaluated  in 
ERAS  aimed  at  pelagic  organisms  in  surface  waters.  The  potential  for  a  substance  to  ac¬ 
cumulate  in  a  particular  environmental  compartment  can  be  estimated  by  comparisons 
of  the  substance  s  biological  half-life  with  its  residence  time  in  that  compartment  (Larson 
and  Cowan  1995). 

While  chemicals  that  readily  mineralize  have  less  potential  to  accumulate  in  sediments 
(Fendinger  et  ah  1994),  persistent  chemicals  require  broader  and  more  intensive  evalua¬ 
tion  in  all  environmental  compartments,  including  sediments.  Sorptive  chemicals  typi¬ 
cally  accumulate  in  environmental  compartments  with  longer  residence  times,  e.g., 
sediments.  Given  the  longer  residence  in  these  compartments,  biodegradation  proceed¬ 
ing  at  rates  slower  than  in  surface  waters  can  still  be  a  practical  removal  mechanism  for 
commercial  chemicals  (Ventullo  and  Larson  1994). 

Screening-level  tests  that  measure  the  biological  half-life  of  a  compound  under  aerobic 
conditions  can  be  cost-effective  and  are  often  conducted  at  an  early  stage  in  technology 
development.  Results  of  biodegradability  tests  simulating  aerobic  conditions  in  surface 
waters  {e.g,  American  Society  for  Testing  and  Materials  [ASTM]  1995a)  frequently  can 
be  used  to  help  predict  chemical  fate  in  sediments.  Anaerobic  biodegradability  testing 
requires  specialized  skills  and  laboratory  facilities  and  is  not  usually  conducted  in  the 
course  of  initial  screening. 

However,  screening-level  tests  for  anaerobic  biodegradability  are  available  (European 
Centre  for  Ecotoxicology  and  Toxicology  of  Chemicals  [ECETOC]  1988).  Field  investiga¬ 
tions  of  biodegradability  (Shimp  and  Schwab  1991;  Federle  and  Schwab  1992)  are  usually 
conducted  to  confirm  results  of  laboratory-based  predictions  for  commercially  impor¬ 
tant  or  large-volume  product  ingredients. 

Potential  risks  of  food-chain  transfer  and  ecological  (or  human  health)  effects  from 
chemicals  that  bioaccumulate  in  benthic  organisms  can  also  be  evaluated  early  in  screen¬ 
ing  new  chemical  technologies.  While  bioaccumulation  concerns  are  not  always  specific 
to  chemicals  that  partition  in  sediment,  the  chemicals  that  partition  are  often  persistent 
and  may  bioaccumulate.  Structure-activity  relationships  for  predicting  bioaccumulation 
potential  have  been  described  for  a  large  number  of  chemicals  (Lyman  et  al  1982).  For 
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this  reason  and  for  cost  reasons,  laboratory  bioaccumulation  tests  are  typically  con¬ 
ducted  in  the  course  of  higher-level  ERAs  and  are  reserved  for  the  most  promising  chemi¬ 
cal  technologies.  Statutory  requirements  (Table  3-1)  may  also  trigger  empirical 
bioaccumulation  testing,  particularly  for  certain  technology  sectors  (e.g.,  agricultural 
chemical  production). 

3.4  Empirical  approaches  for  chemical  technology  evaluation 

More  definitive  exposure  estimation  techniques  and  direct  testing  of  benthic  organism 
sensitivity  may  be  necessary  when  safety  questions  arise  upon  initial  screening  or  when 
regulatory  requirements  dictate  empirical  testing.  To  conserve  costs  and  to  expedite  tech¬ 
nology  development,  toxicity  testing  of  benthic  organisms  is  usually  reserved  for  materi¬ 
als  that  have  a  demonstrated  commercial  value  and  a  potential  to  accumulate  in 
sediments  (Pittingereffl/.  1989). 

Definitive  exposure  testing,  unless  required  by  law,  is  similarly  reserved  for  “mature”  or 
highly  commercialized  technology  markets  in  order  to  verify  or  validate  model  predic¬ 
tions.  These  often  employ  sediment-water  systems  that  vary  in  size  from  small  aquaria  to 
experimental  streams,  in  which  material  concentrations  can  be  directly  measured  in  pore 
water,  sediment  solids,  whole  sediment,  and  overlying  water  (Pittinger  etal  1988).  With 
this  information,  better  mechanistic  understanding  of  sediment  and  porewater  expo¬ 
sures  and  first-order  biodegradation  properties  can  be  gained. 

A  number  of  sediment  toxicity  tests  have  been  developed  and  standardized  (Table  3-2) 
and  are  available  through  contract  laboratories.  Toxicity  testing  of  new  materials  with 
benthic  organisms  is  conceptually  similar  to  testing  pelagic  organisms;  however,  it  is  of¬ 
ten  considerably  more  involved  (and  expensive)  due  to  1)  the  potential  for  multiple 
routes  of  exposure  to  the  organism  (i.e.,  respiration  of  pore  water,  ingestion  of  sediment 
particles,  and  direct  epidermal  contact)  and  2)  complex  partitioning  and  accelerated 
transformation  processes  that  may  greatly  alter  bioavailability  and  introduce  a  plethora 
of  secondary  compounds.  For  this  reason,  direct  benthic  toxicity  tests  ideally  should  be 
designed  to  account  for  transformation  processes  and  ecological  and  life  history  charac¬ 
teristics  of  the  organisms  most  likely  to  be  exposed. 

Field  monitoring  of  concentrations  of  commercial  chemicals  in  sediments  can  provide 
the  most  realistic  estimation  of  direct  exposure  to  benthic  organisms  but  typically  is  cost¬ 
intensive,  time-intensive,  and  retroactive  in  nature.  As  such,  monitoring  is  normally  con¬ 
ducted  for  higher-tiered  risk  assessments.  Monitoring  is  often  constrained  by  the  avail¬ 
ability  of  sensitive  and  specific  analytical  methods.  The  reliability  of  monitoring  data  is 
usually  determined  by  selection  of  representative  sites,  by  sampling  and  storage  proce¬ 
dures,  and  by  the  use  of  accurate  and  precise  analytical  methods  (ASTM  1995b). 
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Table  3-1  Summary  of  U.S.  and  European  toxicity  test  requirements  by  regulatory  requirement 


Regulation  or  statute 

Type  of  testing  required 

•  Clean  Water  Act  (CWA) 

Aquatic  tests  for  the  protection  of  surface  waters 

Water  Quality  Standards 

Aquatic  tests  for  the  development  of  water  quality  criteria 
(WQC) 

Sediment  Quality  Criteria  (SQC) 

No  aquatic  testing  required  (use  WQC  and 

USEPA  National  Pollutant  Discharge 
Elimination  System  (NPDES) 
Regulations 

No  tests  required  at  this  time 

•  Toxic  Substances  Control  Act  (TSCA) 

Industrial  and  specialty  chemicals:  aquatic  assessments 

Premanufacture  Notification  (PMN) 

Sediment  testing  usually  not  required.  Tests  can  be 
required  for  high  Kqw  chemicals 

Section  Four  Test  Rule 

Sediment  tests  with  Chironomm  tentans  have  been  used 

Aquatic  Test  Guideline  Number; 

795.12 

Hyaklla  azteca  flow-through  acute  test 

797.131 

Gammarid  acute  test  {Gammanis  sp .) 

797.193 

Mysid  shrimp  acute  test 

797.195 

Mysid  shrimp  chronic  test 

797.197 

Penaeid  shrimp  acute  test 

Adams  etal.  (1985) 

Midge  partial  life-cycle  test 

•  Federal  Insecticide,  Rodenticide  and 
Fungicide  Act  (FIFRA) 

Pesticide  registration:  aquatic  assessments 

Subdivision  E  -  Wildlife  and  Aquatic 
Organisms 

Simulated  or  actual  field  tests  for  aquatic  organisms 
(mesocosms  or  outdoor  microcosms) 

•  Food,  Drug  and  Cosmetics  Act 

Environmental  assessments  for  new  food,  drug,  and 
cosmetic  products 

Section  4.10 

Hyaklla  azteca  acute  toxicity  test  has  been  required 

•  Organization  of  Economic  Cooperation 
and  Development  (OECD)  and  European 
Economic  Community 

European  Community  aquatic  testing  requirements 

Sediment  testing  protocols  under  development 

•  Paris  Commission  (PARCOM) 

European  Community:  Paris  Commission 

Offshore  chemical 
notification/evaluation 

Sediment  reworker  test  ( Corophium  volutator,  Nereis  virens, 
mdAbraalba) 

SETAC  Press 


Session  2:  Product  Safety  Assessment,  Chapter  3  1 7 


Table  3-2  Summary  of  ASTM  standard  procedures  for  conducting  aquatic  sediment  toxicity  tests 


Test  description 

Reference  ^ 

Guide  for  conducting  static  acute  toxicity  tests  starting  with  embryos  of  four 
species  of  saltwater  bivalve  molluscs^ 

ASTM  E  724-94 

Practice  for  conducting  bioconcentration  tests  with  fishes  and  saltwater  bivalve 
molluscs^ 

ASTM  E 1022-94 

Guide  for  conducting  life-cycle  toxicity  tests  with  saltwater  mysids^ 

ASTM  E 1191-90 

Guide  for  conducting  renewal  life-cycle  toxicity  tests  with  Daphnia  magna^ 

ASTM  E 1193-94 

Guide  for  conducting  three-brood,  renewal  toxicity  tests  with 

Ceriodaphnia  dubia  ^ 

ASTM  E 1295-89 
(Reapproved  1995) 

Guide  for  conducting  10-day  static  sediment  toxicity  tests  with  marine  and 
estuarine  amphipods 

ASTM  E 1367-92 

Guide  for  collection,  storage,  characterization,  and  manipulation  of  sediments 
for  toxicological  testing 

ASTM  E 1391-94 

Guide  for  conducting  the  frog  embryo  teratogenesis  assay-xenopus  (fetax)^ 

ASTM  E 1439-91 

Guide  for  conducting  static  and  flow-through  acute  toxicity  tests  with 

Mysids  from  the  west  coast  of  the  United  States^ 

ASTM  E 1463-92 

Designing  biological  tests  with  sediments 

ASTME1525-94a 

Conducting  sediment  toxicity  tests  with  marine  and  estuarine  polychaetous 
annelids 

ASTM  E 1611-94 

Guide  for  determining  bioaccumulation  of  sediment-associated  contaminants 
by  benthic  invertebrates 

ASTM  E 1688-95 

Test  method  for  measuring  toxicity  of  sediment-associated  contaminants  with 
freshwater  invertebrates 

ASTM  E 1706-95 

1  Source:  ASTM  1995c. 

^The  aforementioned  test  is  not  specific  to  sediments,  but  the  methodology  has  frequently  been  modified  to  allow  the 
test  species  to  be  tested  with  whole  sediments  or  with  sediment  pore  water. 


3.5  Molecular  design  to  alleviate  potential  sediment-related 
concerns 

Early  technology  development  efforts  in  designing  new  chemicals  and  commercial  prod¬ 
ucts  are  increasingly  incorporating  environmental  considerations  to  minimize  the  poten¬ 
tial  for  ecological  risk.  These  environmental  criteria  must  be  balanced  with  important 
commercial  criteria  such  as  efficacy  and  stability  of  the  ingredient  in  the  product  matrix, 
manufacturing  cost  and  logistical  feasibility  in  manufacturing  or  formulating,  human 
safety  concerns,  etc.  Balancing  these  and  other  considerations  in  technology  development 
is  extremely  challenging,  requiring  close  coordination  among  technical  professionals 
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involved  in  product  and  process  development,  manufacturing,  and  marketing,  as  well  as 
in  the  life  sciences  (White  et  al  1995). 

The  development  of  readily  biodegradable  substances  for  commercial  applications  is 
perhaps  the  surest  approach  to  minimizing  risks  to  benthic  communities.  Recent  ad¬ 
vances  in  understanding  mechanisms  of  degradation  have  given  rise  to  ‘  weak-link  chem¬ 
istry.”  The  incorporation  of  readily  hydrolyzable  bonds  at  key  structural  locations  within 
a  molecule  may  ensure  complete  and  ready  degradation  through  pathways  involving  la¬ 
bile  intermediates.  For  example,  inclusion  of  two  weak  ester  linkages  in  the  structure  of 
a  cationic  surfactant  enabled  the  compound  to  be  rapidly  biodegraded  in  standard  labo¬ 
ratory  tests  and  a  range  of  environmental  compartments  (Giolando  etal  1995).  Develop¬ 
ing  organic  chemicals  with  less  highly  substituted  {e.g.,  tertiary  carbon  atoms)  moieties 
can  also  facilitate  enzyme-mediated  decomposition  reactions. 

To  a  lesser  extent,  sorptivity  or  hydrophobicity  can  be  manipulated  for  certain  chemical 
classes  and  technological  applications.  Sorptivity,  or  the  tendency  for  a  substance  to 
“stick”  to  surfaces,  may  however  be  integral  to  its  function  in  a  commercial  product  ma¬ 
trix.  One  example  is  detergent  softening  agents,  which  by  nature  must  cling  to  fabric 
surfaces  in  order  to  deliver  the  desired  softening  benefit  (Giolando  et  al  1995).  For  prod- 
uct  applications  requiring  the  use  of  sorptive  materials,  practical  biodegradability  is  the 
key  to  satisfying  product  development  needs  and  ensuring  environmental  compatibility. 

3.6  Science  policy  considerations  in  assessing  and  managing 
sediment  contaminants 

Important  science  policy  questions  pertaining  to  the  need  for  and  development  of  sedi¬ 
ment  quality  criteria  (SQC)  and  regulatory  standards  in  the  U.S.  for  registering  new 
chemical  technologies  and  for  managing  contaminated  sediments  have  been  (Adams  et 
al  1990)  and  are  being  broadly  debated,  even  as  SQC  are  being  prepared  for  publication 
by  the  USEPA.  Some  of  these  criteria  prompt  the  following  questions: 

•  Are  current  water  quality  criteria  (WQC)  adequate  to  prevent  sediment  con¬ 
tamination?  Are  sediments  allowed  to  be  legally  contaminated  to  toxic  levels 
today?  Is  another  set  of  national  criteria  and  standards  needed? 

•  Does  the  severity  of  contaminated  sediments  in  the  U.S.  justify  a  comprehensive 
national  sediment  management  strategy? 

•  Do  the  USEPA  s  Sediment  Quality  Criteria  and  the  National  Sediment  Manage¬ 
ment  Strategy  focus  on  historical  problems  that  may  not  be  readily  amenable  to 
resolution  through  future  management  policies? 

•  Are  costs  and  benefits  of  site  remediation  appropriately  factored  into  risk 
management  decisions  for  contaminated  sediments,  or  is  “cleanup  at  any  cost” 
the  de facto  federal  and  state  remediation  standard? 

Environmental  scientists,  particularly  those  in  an  applied  field  such  as  ecotoxicology, 
must  recognize  that  the  scientific  methods  and  questions  they  pursue  have  very  real  eco¬ 
nomic  and  social  implications.  Sound  application  of  scientific  methods  and  results  to 
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regulatory  decision-making  requires  a  number  of  elements,  including  well-validated 
methods  for  sediment  assessment;  continuing  and  open  dialogue  among  stakeholders 
involved  in  and  impacted  by  the  process;  identification  of  decision  points  to  reduce  un¬ 
necessary  testing  and  data  generation,  as  well  as  criteria  that  clearly  determine  when  the 
assessment  process  is  complete;  consideration  of  costs  and  benefits  of  instituting  further 
regulatory  requirements;  and  an  unbiased  peer  review  process  for  draft  regulations. 

3.7  Conclusions 

A  review  of  current  assessment  techniques  to  evaluate  the  safety  of  chemicals  in  the  en¬ 
vironment  indicates  that  scientific  understanding  and  tools  to  perform  aquatic  risk  as¬ 
sessments  have  significantly  improved  over  the  last  decade.  The  need  for  considering 
sediment  exposures  to  benthic  organisms  as  part  of  an  overall  aquatic  risk  assessment  for 
new  technologies  and  products  is  becoming  widely  understood  across  both  the  private 
and  public  sectors.  Techniques  exist  and  are  being  developed  which  can  be  used  for 
screening  and  evaluating  technologies  and  products  during  their  development.  This  al¬ 
lows  for  early  identification  of  potential  environmental  safety  concerns  and  informed 
decision-making  in  the  commercialization  process.  It  is  recognized  that  commercially 
valuable  chemicals  can  often  be  designed  to  be  more  biodegradable  and  less  sorptive  to 
sediments,  thereby  improving  their  environmental  compatibility.  In  looking  to  the  fu¬ 
ture,  it  will  be  important  to 

•  validate  the  methodologies  used  to  perform  sediment  risk  assessments, 

•  better  evaluate  uncertainties  of  the  risk  estimates, 

•  critically  evaluate  the  science  policies  underlying  the  regulatory  process  and 
their  cost/benefits  to  society,  and 

•  ensure  that  risk  management  decisions  based  upon  these  assessments  reflect 
sound  science. 
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Workgroup  summary  report  on  sediment 
risk  assessments  of  commercial  products 

Charles  A.  Pittinger,  William  J  Adams  Joseph  J  Dulka,  Rachel  Fleming, 
RichKimerle,  Patricia E.  King,  Thomas  W.  LaPoint,  Tony Maciorowski,  Gregory  Schiefer 

4.1  Introduction 

This  chapter  presents  an  approach  for  conducting  sediment  ecological  risk  assessments 
(SERAs)  of  new  and  existing  products.  Sediment  ERAs  are  usually  performed  as  part  of 
an  overall  risk  assessment  for  a  given  product,  as  opposed  to  an  independent  assessment 
of  the  sediment  compartment.  Products  are  defined  as  chemicals.  These  chemicals  form 
the  ingredients  for  consumer  product  formulations  and  for  industrial  and  agricultural 
products.  In  this  sense,  single  chemicals,  chemical  mixtures,  impurities,  and  intermedi¬ 
ate  compounds  formed  during  their  manufacture  or  formulation  are  relevant.  Excluded 
from  consideration  in  this  context  are  genetically  engineered  and  nonengineered  micro¬ 
organisms  or  nonchemical  commercialized  products  (e.g.,  devices,  fabrications,  debris, 
dredged  sediments). 

Sediment-specific  ERAs  of  products  can  be  either  prospective  or  retrospective  in  nature. 
Prospective  assessments  pertain  to  new  chemical  products  requiring  initial  safety  evalua¬ 
tion  prior  to  their  use  or  commercialization.  Often  these  assessments  involve  tests  with 
sediments  artificially  treated  with  the  product  or  its  ingredients  in  the  laboratory.  Retro¬ 
spective  assessments  focus  on  evaluations  of  existing  chemicals  following  some  period  of 
use  and  discharge  to  the  environment  and  typically  involve  the  testing  of  field-collected 
samples.  The  latter  type  of  assessment  may  be  prompted  by  the  need  for  additional  safety 
testing  in  light  of  new  information  or  the  development  of  new  methods,  or  for  monitor¬ 
ing  to  periodically  confirm  the  accuracy  of  exposure  estimates.  Given  the  logistical  simi¬ 
larities  of  retrospective  product  assessments  to  site  evaluations  (Chapter  7),  the  primary 
focus  of  this  chapter  will  be  prospective  assessments  of  new  chemical  products. 

Prospective  risk  assessments  of  new  chemical  products  are  conducted  by  both  the  private 
and  public  sectors  in  order  to  satisfy  corporate  safety  standards  or  to  comply  with  exist¬ 
ing  environmental  regulations.  To  date,  the  scientific  and  regulatory  processes  to  assess 
the  potential  risks  of  chemical  products  associated  with  sediments,  both  prospectively 
and  retrospectively,  have  been  limited.  Aquatic  ERAs  have  historically  focused  on  the 
protection  of  water  column  organisms  (fish  and  invertebrates).  Until  recently,  consider¬ 
ably  less  attention  has  been  paid  to  the  ecologically  relevant  benthic  community  occupy¬ 
ing  the  sediment  compartment.  The  key  assessment  endpoint  of  sediment  product  risk 
assessments  is  the  protection  of  benthic  species,  ecological  communities,  and  associated 
food  webs  leading  to  humans. 
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The  nature  and  complexity  of  the  sediment  environment  offers  unique  challenges  in  as¬ 
sessing  product  risk.  Chief  among  these  are  the  difficulty  in  measuring  or  estimating  the 
distribution  and  bioavailability  of  products  among  whole  sediment  and  pore  waters 
which  determine  exposure  of  benthic  organisms.  Effects  assessments  of  products  to 
benthic-dwelling  organisms  may  be  further  complicated  by  unique  habitat  requirements 
which  are  difficult  to  simulate  under  laboratory  conditions  and  by  an  absence  of  life  his¬ 
tory  and  physiological  information  for  many  benthic  species.  These  difficulties  have  lim¬ 
ited  the  amount  of  data  collected  for  sediment  assessments.  Therefore,  probabilistic 
estimations  of  risk  are  rare  and  are  largely  limited  to  products  in  long-term  use  (e.g.,  de¬ 
tergent  ingredients)  or  products  receiving  a  particularly  high  degree  of  scrutiny  for  initial 
registration  (e.g.,  pesticides). 

This  section  proposes  a  sediment-specific  approach  for  product  risk  assessment  in  order 
to  address  these  limitations  and  to  fulfill  a  need  for  clear  criteria,  or  “triggers,”  for  deter¬ 
mining  when  and  to  what  degree  formal  sediment  assessments  may  be  required.  A  ratio¬ 
nal  approach  to  problem  formulation  is  presented,  which  enables  public-  and 
private-sector  risk  assessors  and  risk  managers  to  determine  needs  for  explicit  and  tiered 
sediment  risk  assessments.  Such  an  approach  requires  the  definition  of  clear  assessment 
endpoints  and  reproducible  measurement  endpoints  that  meaningfully  relate  to  the  as¬ 
sessment  endpoints. 

The  incorporation  of  evaluation  points  during  problem  formulation  provides  the  flexibil¬ 
ity  needed  to  ensure  the  integrity  of  the  process  and  the  protection  of  the  environment, 
while  simultaneously  ensuring  that  this  evaluation  is  done  in  the  most  cost-effective  and 
efficient  manner.  These  evaluation  steps  also  determine  the  sufficiency  of  the  information 
assessed  at  each  stage,  thereby  ensuring  that  the  investigative  effort  and  resources  are 
proportional  to  the  product  s  environmental  risk.  Additionally,  a  straightforward  evalu¬ 
ation  approach  for  exposure  assessment  of  products  to  benthic  communities  is  pre¬ 
sented.  Finally,  key  research  questions  and  limitations  in  sediment-related  product  risk 
assessments  are  identified. 

4.2  Conceptual  framework  for  prospective  product 

Several  ERA  frameworks  were  reviewed  and  discussed  by  workshop  participants.  These 
included  the  Environment  Canada  (1996),  USEPA  (1992),  European  Commission  (EC 
1994),  and  Oak  Ridge  National  Laboratory  (Barnthouse  and  Suter  1986)  approaches.  It 
was  concluded  that  these  approaches  were  conceptually  similar  and  provided  a  useful 
model  to  develop  the  product  sediment  risk  assessment  approach. 

The  basic  components  of  an  approach  for  product  SERA  are  depicted  in  Figure  4-1.  The 
risk  assessment  approach  is  patterned  after  the  USEPA  Ecological  Risk  Assessment 
Framework  (USEPA  1992)  and  is  intended  to  provide  guidance  for  assessing  the  risk  of 
chemicals  sorbed  to  sediments.  The  approach  assumes  that  risk  assessment  is  conducted 
in  a  risk  management  framework  that  emphasizes  communication  and  interaction  be¬ 
tween  risk  assessors  and  decision-makers  (risk  managers).  The  approach  includes  several 
discrete  phases  including  problem  formulation,  exposure  and  effects  analysis,  risk  char- 
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Figure  4-1  Conceptual  framework  for  conducting  product  sediment  risk  assessment 


acterization,  and  risk  assessment-risk  management  discussions  leading  to  a  risk  manage¬ 
ment  decision. 

As  a  brief  overview,  problem  formulation  is  the  starting  point  for  the  product  sediment 
risk  assessment  and  begins  with  initial  planning  discussions  between  the  risk  assessor 
and  the  risk  manager  to  determine  whether  or  not  there  is  a  potential  sediment  risk  and 
a  reason  for  conducting  a  formalized  assessment.  This  determination  is  often  based  on 
scientific  judgment  with  a  minimum  of  data.  If  no  further  assessment  of  the  sediment 
environmental  compartment  is  deemed  necessary  because  there  is  judged  to  be  negli- 
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gible  risk  to  sediment  biota,  the  process  proceeds  to  a  risk  assessment-risk  management 
discussion  and  final  decision  (see  Section  4.3). 

If  a  potential  risk  is  identified,  or  if  insufficient  evidence  exists  to  conclude  there  is  a  low 
probability  of  risk,  a  formal  sediment  risk  assessment  is  performed  (see  Section  4.4).  In 
the  analysis  step,  all  necessary  exposure  and  ecological  effects  studies  are  conducted  and 
made  available  to  the  risk  assessor.  In  the  risk  characterization  step,  the  risk  assessor  uses 
the  appropriate  tools  to  perform  the  risk  assessment  by  integrating  exposure  and  effects 
data.  Finally,  the  risk  assessor  and  risk  manager  again  enter  into  discussions  to  determine 
whether  the  risk  assessment  is  adequate  and  relevant  to  the  impending  management 
decision.  Questions  addressed  by  the  risk  assessor  and  the  risk  manager  at  this  point 
include  these: 

•  Were  the  goals  of  the  risk  assessment  attained? 

•  Is  there  a  need  to  reformulate  the  problem? 

•  Is  there  a  need  to  collect  more  data  on  exposure  or  effects? 

•  Were  the  uncertainties  adequately  expressed? 

•  Was  the  choice  of  how  the  risk  characterization  was  performed  correct  and 
useful? 

•  What  is  the  recommendation  to  the  risk  manager  on  the  nature  and  magnitude 
of  the  risk  in  relation  to  the  agreed-upon  assessment  endpoint? 

4.3  Problem  formulation 

The  evaluation  of  products  manufactured  and  disposed  or  discharged  into  the  environ¬ 
ment  contains  elements  of  both  science  and  policy  and  requires  close  collaboration  be¬ 
tween  the  risk  assessor  and  risk  manager.  Problem  formulation  is  intended  to  frame  the 
risk  management  decision  in  terms  appropriate  for  hypothesis  testing.  It  involves  consid¬ 
eration  of  potential  risks  with  the  product’s  manufacture,  use,  distribution,  or  disposal 
and  of  what  amount  of  data  might  be  required  to  assess  that  problem  (Figure  4-1).  There 
are  certain  situations  and  certain  chemicals  (see  Table  4-1)  for  which  no  complex  risk 
assessment  may  be  required.  However,  all  products  require  some  consideration  of  their 
potential  to  cause  adverse  effects  to  sediment  biota  as  well  as  other  environmental  com¬ 
partments  and  communities.  It  is  critical  that,  at  this  first  stage  of  the  ERA,  the  problem 
formulation  include  a  discussion  between  the  risk  assessor  and  risk  manager  of  available 
data  related  to  the  physical-chemical  properties  of  the  product  as  well  as  its  likely  use  and 
disposal.  Prior  to  the  collection  of  any  further  exposure  or  effects  characterization,  cer¬ 
tain  triggers  based  upon  the  available  data  (Table  4-1)  may  be  used  to  determine  if  further 
assessment  is  required.  These  triggers,  generally  based  upon  expert  opinion,  indicate  the 
likelihood  of  no  environmental  concerns  or,  conversely,  the  need  for  additional  testing 
and  evaluation.  They  primarily  relate  to  organic  chemicals  used  as  product  ingredients  or 
commodity  chemicals.  Such  triggers  may  include,  but  are  not  limited  to,  a  low  biocon¬ 
centration  factor  (BCF),  short-term  persistence  (days)  in  the  sediments  or  water,  low 
sorbtivity  to  sediments,  or  other  data/considerations  to  indicate  a  large  margin  of  safety. 


SET  AC  Press 


Session  2:  Product  Safety  Assessment,  Chapter4  27 


Table  4-1  Screening-level  triggers  to  determine  need  for  additional  sediment  risk  assessments 


Screening  triggers  for  ftirther  sediment  risk  assessment 

Risk  to  benthic  organisms 

More  assessment  needed 

No  significant  release  to  the  environment 

Significant  release  to  the  environment 

Low  persistence  ^  or  readily  degradable  (for 
organic  products  or  ingredients) 

Persistent;  not  readily  degradable 

Low  sorption  to  sediments  ^ 

Sorptive  to  sediments 

High  volatility^ 

Nonvolatile 

Nontoxic  to  aquatic  organisms^ 

Known  toxicity  or  mechanism  of  action 

Similarity  to  known  chemicals  of  low 
environmental  concern  ^ 

Similarity  to  known  chemicals  of  higher 
environmental  concern 

^  The  ratio  of  a  compound’s  biological  half-life  to  its  residence  time  in  an  environmental  compartment  has  been 
used  (Larson  and  Cowan  1995)  to  assess  persistence  of  continuous  releases  of  consumer  products. 

^  Log  K  ow  or  log  K  oc  >  3,0  has  been  used  by  the  USEPA  (TSCA)  to  indicate  a  need  for  ftirther  assessment. 

^  Heniy’s  Law  Constant  ^  10 

^  As  inferred  from  other  compartments  or  test  species,  ie. ,  water  column  tests. 

^  Use  of  structure-activity  relationships  and  best  professional  judgment  may  be  appropriate  for  some  products. 


In  addition  to  physical-chemical  characteristics  of  the  product,  there  are  a  number  of  key 
exposure  considerations  requiring  discussion  by  the  risk  assessor  and  manager,  includ¬ 
ing  the  following; 

•  Is  there  a  potential  for  the  material  to  enter  the  environment? 

•  During  what  stage  of  the  product  s  life  cycle  and  at  what  levels  does  environ¬ 
mental  release  occur? 

•  In  what  other  products  or  processes  may  the  product  or  its  ingredients  be 
discharged? 

•  If  the  product  is  emitted  into  the  environment,  does  it  have  the  potential  to 
reach  an  aquatic  environment  directly  or  indirectly? 

•  What  is  the  frequency  of  occurrence  (e.g, ,  continuous  outfall  versus  pulsed 
events)? 

•  What  is  the  level  of  exposure  per  event? 

•  Is  the  sediment  exposure  likely  to  be  an  acute  or  a  chronic  exposure  issue? 

•  Upon  entering  an  aquatic  environment,  does  the  product  have  the  potential  to 
partition  into  sediment? 

•  Do  organic  constituents  of  the  product  readily  biodegrade,  either  aerobically  or 
anaerobically? 


SETAC  Press 


28 


ECOLOGICAL  RISK  ASSESSMENT  OF  CONTAMINATED  SEDIMENTS 


•  Does  the  product  leave  the  system  through  volatilization,  hydrolysis,  photooxi¬ 
dation,  or  some  other  physical  degradation  mechanism? 

•  Is  the  receiving  sediment  disposed  toward  biodegradation? 

•  Is  it  likely  to  bind  in  the  receiving  sediment  due  to  high  organic  carbon  (OC) 
content  of  the  sediment  or  high  (high  carbon  normalized)  sediment-water 
partition  coefficient  (Kq^)? 

•  Once  in  the  sediment,  is  the  product  bioavailable? 

•  Is  the  product  lipophilic,  i.e. ,  is  the  log  octanol-water  partition  coefficient 
(log  P)  >  3?  Can  it  bio  accumulate  and  at  what  rate? 

•  Is  the  product  toxic  to  other  aquatic  species  (e.g.,  fish,  Daphnia)? 

If  it  is  determined  that  there  are  potential  risks  with  the  product  meriting  formal  risk 
assessment,  or  if  it  is  deemed  that  insufficient  data  exist  with  which  to  make  a  definitive 
decision,  the  risk  assessment  proceeds  to  the  exposure  and  effects  characterizations  and 
follows  the  “typical”  risk  assessment  paradigm. 

4.4  Tiered  testing  and  analysis 

Tiered  approaches  in  sediment  risk  assessment  of  new  products  and  chemicals  embody 
the  concept  of  an  iterative  process  with  decision  points  between  iterations,  allowing 
evaluation  of  the  adequacy  of  the  assessment  and  a  decision  either  to  continue  with  data 
collection  or  to  finalize  the  process.  Initial,  screening-level  assessments  may  lead  to  more 
sophisticated  iterations  involving  higher  level  (e.g.,  chronic  testing,  field  monitoring) 
datasets.  Needs  for  higher  level  testing  for  effects  or  exposure  assessments  may  be  consid¬ 
ered  independently;  for  example,  a  decision  could  be  made  that  effects  data  are  sufficient 
while  higher  level  exposure  information  is  required.  Results  from  each  tier  or  iteration  are 
collated,  analyzed,  and  synthesized.  Results  from  higher  level  testing  or  monitoring,  if 
needed,  are  used  to  refine  the  risk  characterization.  The  iterative  approach  is  essential  to 
efficient  product  testing  because  it  maximizes  the  use  of  personnel  and  resources  and 
minimizes  costs  to  both  the  manufacturer  and  the  regulatory  body  responsible  for  prod¬ 
uct  registration  or  approval. 

Tiered  evaluation  typically  begins  with  screening-level  assessment,  employing  conserva¬ 
tive  assumptions  and  limited  datasets  to  broadly  estimate  the  potential  for  risk.  When 
there  are  sufficient  data  to  reasonably  conclude  that  there  is  a  low  probability  for  a  chemi¬ 
cal  to  be  discharged,  reach  the  sediments,  or  exert  adverse  ecological  effects,  no  further 
or  formal  assessment  may  be  required.  This  does  not  preclude  risk  considerations  for 
wildlife,  aquatic  life,  or  human  health.  Considerations  or  triggers  for  determining  the 
need  for  additional  data  are  listed  in  Table  4-1.  Any  one  of  these  triggers  may  support  a 
decision  to  either  terminate  the  assessment  or  proceed  with  additional  testing. 

When  review  of  the  available  data  indicates  a  formalized  sediment  risk  assessment  is  jus¬ 
tified,  the  following  considerations  in  problem  formulation  are  incorporated:  goal  set¬ 
ting;  exposure  and  effects  models  that  include  the  appropriate  assessment  and 
measurement  endpoints;  criteria  to  decide  whether  enough  data  exist  to  make  a  defen- 


SETAC  Press 


Session  2:  Product  Safety  Assessment,  Chapter  4 


29 


sible  scientific  decision  on  risk;  levels  of  uncertainty;  and  regulatory,  societal,  or  corpo¬ 
rate/policy  issues.  The  approach  to  conducting  the  definitive  tiered  sediment  risk  assess¬ 
ment  is  outlined  in  Sections  4.5  and  4.6. 

4.5  Exposure  analyses 

4.5.1  Estimation  of  product  exposure  via  modeling 

The  goals  of  exposure  analysis  in  performing  a  product  evaluation  are  to  identify  and 
evaluate  various  sediment  exposure  scenarios  to  estimate  or  predict  the  levels  of  exposure 
associated  with  each.  This  results  in  what  is  classically  termed  the  estimated  or  predicted 
environmental  concentration  (EEC  or  PEC).  In  detailed  retrospective  risk  assessments, 
the  exposure  scenarios  of  greatest  interest  are  often  investigated  by  means  of  environ¬ 
mental  monitoring.  In  prospective  assessments,  exposure  models  are  applied  with  in¬ 
creasing  sophistication  in  a  tiered  approach. 

Most  products  are  complex  mixtures  of  organic  ingredients.  During  consumer  usage  or 
disposal,  products  typically  lose  their  integrity  as  ingredients,  or  components  follow 
unique  pathways  into  and  through  the  environment.  For  example,  ingredients  of  house¬ 
hold  cleaning  products  discharged  to  municipal  wastewater  treatment  are  rapidly  seques¬ 
tered  into  aqueous,  solid,  or  vapor  phases  where  they  are  individually  transported, 
transformed,  or  assimilated  into  various  environmental  media  and/or  degradation  pro¬ 
cesses.  Sediment  risk  assessments  of  products  take  into  account  this  phenomenon  by 
recognizing  the  unique  fate  and  effects  of  individual  ingredients,  their  by-products  and 
metabolites.  Results  of  whole  (intact)  product  fate-and-effects  testing  rarely  if  ever  can 
provide  a  realistic  assessment  of  the  collective  environmental  risks  of  discrete  ingredi¬ 
ents.  For  this  reason,  assessments  of  products  are  typically  conducted  on  an  ingredient- 
by-ingredient  basis. 

Exposure  data  (either  estimated  or  measured  environmental  concentrations)  are  typically 
compared  to  various  effects  endpoints  to  determine  whether  a  sufficient  margin  of  safety 
exists  (see  Section  4,5.2).  The  EEC  should  be  considered  as  a  range  of  values  rather  than 
a  single  number.  The  EEC  should  consider  not  only  the  presence  of  chemical  in  the  com¬ 
partment,  but  also  its  availability  to  the  organisms  of  concern.  For  sediment-  (and  soil-) 
borne  materials,  the  issue  of  bioavailability  is  particularly  critical  in  defining  the  exposure 
to  the  organism  tested  in  the  laboratory  as  well  as  the  organisms  in  the  field. 

Methods  for  estimating/measuring  exposure  for  the  purpose  of  developing  the  exposure 
characterization  include  the  following: 

•  Exposure  models  based  on  a  variety  of  inputs  including  physical-chemical 
properties,  environmental  conditions,  degradation  routes  (photolysis,  hydroly¬ 
sis,  biodegradation),  and  sorption  characteristics 

•  Estimates  of  model  sensitivity  and  uncertainty  in  the  exposure  scenarios 

•  Refined  modeling  approaches,  e.g. ,  the  use  of  more  sophisticated  or  data- 
intensive  models,  and/or  the  verification  of  modeling  assumptions  and  param¬ 
eters 
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•  Initial  environmental  monitoring,  if  relevant  for  retrospective  assessments, 
including  data  on  similar  products 

•  Confirmatory  environmental  monitoring 

Various  models  address  emission  routes  from  publicly  owned  treatment  works  (POTWs), 
from  manufacturing  facilities,  and  from  agricultural  fields  and  could  be  used  to  assist  in 
evaluating  exposure  scenarios  to  sediment.  These  models,  combined  with  input  from  a 
benthic  biologist  on  the  behavior  and  habits  of  benthic  organisms,  should  be  used  in 
establishing  the  routes  of  exposure  that  lead  to  definitive  effects  testing.  These  models 
vary  from  general  use  (e.g.,  Mackay  Fugacity  Models  -  Levels  I  and  II  [Mackay  1991])  to 
more  specialized  cases  used  for  POTWs  (Cowan  etal.  1995),  agricultural  products  (Baker 
et  al.  1994),  or  site-specific  models  that  may  have  been  developed  for  a  given  river  or  es¬ 
tuary  (Chapter  15).  All  of  these  models  are  based  on  a  combination  of  current  mechanis¬ 
tic  assumptions  about  partitioning  and  chemical  degradation.  In  combination  with  field 
data  describing  the  range  of  environmental  parameters  (e.g,  percent  organic  matter, 
particle  size  distribution,  partition  coefficients,  water  depth),  they  can  be  used  in  a  deter¬ 
ministic  fashion  to  address  site-specific  issues  as  well  as  to  establish  a  range  of  exposures 
that  may  occur  in  the  environment.  These  ranges  of  exposure  might  then  be  used  in  es¬ 
tablishing  the  dose  for  a  single-concentration  effects  study  or  for  studies  employing  a 
range  of  concentrations  for  a  full  effects  characterization.  While  models  are  useful  in 
guiding  the  exposure  analysis  leading  to  the  risk  assessment,  it  is  also  important  to  rec¬ 
ognize  their  limitations  and  be  sure  that  the  assumptions  are  clearly  stated  in  the  final 
exposure  assessment  presented  to  the  risk  manager. 

Distribution  models  such  as  the  Mackay  fugacity  models  (Mackay  1991)  have  been  used 
to  estimate  the  temporal  distribution  of  a  material  via  various  emission  points.  Key  input 
parameters  include  log  P,  biodegradation  rate  constants,  and  Henry’s  Law  constant  as  an 
index  of  volatility.  Distribution  between  water  and  sediment  is  calculated,  as  well  as  par¬ 
titioning  of  a  material  into  biomass.  The  models  can  be  used  to  develop  local,  regional, 
and  global  exposure  levels.  Di  Toro  etal  (1991)  have  employed  an  equilibrium  partition¬ 
ing  model  to  predict  chemical  concentrations  in  pore  water  on  the  basis  of  The  equi¬ 

librium  partitioning  approach  has  been  recommended  for  screening  purposes  for 
nonionized,  organic  materials  by  a  recent  OECD  document  on  aquatic  effects  assessment 
(OECD  1995). 

Surface  water  models  such  as  Exposure  Analysis  Modeling  System  (EXAMS),  Water 
Quality  Analysis  Simulation  Program  (WASP  5)  are  used  to  estimate  the  distribution  of 
a  material  among  the  various  compartments  of  an  aquatic  system  {e.g,  water,  sediment, 
biota).  These  models  have  been  applied  to  both  household  consumer  product  chemicals 
(Cowan  etal  1995)  and  agricultural  products  (Baker  et  al  1994). 

Runoff  models  such  as  Erosion  Productivity  Impact  Calculator  (EPIC),  Plant  Root  Zone 
Model  (PRZM  2),  Groundwater  Loading  Effects  of  Agricultural  Management  Systems 
(GLEAMS),  Simulator  for  Water  Resources  in  Rural  Basins  (SWRRB),  and  others  have 
been  used  in  the  United  States  to  predict  the  potential  levels  of  product  which  may  leave 
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a  field  during  storm-driven  runoff  events  (Baker  etal  1994).  These  models  require  addi¬ 
tional  inputs  to  describe  the  product’s  use  pattern  (rate  of  application,  application  equip¬ 
ment  method,  frequency  of  application,  time  of  year,  etc),  the  crop  (canopy  interception 
and  uptake,  crop  degradation,  water  budget),  regional  conditions  of  use  (rainfall,  water 
budget,  soil  characteristics,  etc)  and  degradation  parameters  (soil  degradation  rates). 
The  edge  of  field  loads  (both  water-  and  soil-borne  runoff)  developed  from  these  models 
are  then  entered  into  the  surface  water  models  to  determine  the  distribution  of  the  prod¬ 
uct  in  the  aquatic  system  and  the  exposures  in  the  various  aquatic  compartments,  includ¬ 
ing  sediments. 

In  addition  to  runoff  modeling,  agricultural  products  may  move  off-target  at  application. 
Models  used  to  estimate  what  those  levels  may  be  include  the  aerial  spray  drift  model 
AGDISP  and  the  DowElanco  Spray  Drift  Model  (DESDM)  (Baker  et  al  1994).  This  infor¬ 
mation  can  then  be  used  in  a  surface  water  model  to  determine  the  product’s  distribution 
in  an  aquatic  system. 

4.5.2  Exposure  characterization:  dose  considerations 

An  area  of  significant  uncertainty  in  a  sediment  risk  assessment  is  the  definition  and  ac¬ 
curate  measurement  of  the  dose  experienced  by  the  organism.  In  either  laboratory  or 
field  sediments,  the  actual  route  of  exposure  (pore  water,  whole  sediment,  sediment  par¬ 
ticle  fractions,  etc)  is  not  known,  and  the  dose  to  an  organism  is  often  difiicult  to  measure 
analytically.  Di  Toro  et  al  (1991)  have  proposed  the  use  of  equilibrium  partitioning  and 
normalization  of  exposure  based  on  OC  in  whole  sediment  for  nonionic  chemicals.  For 
some  products  and  some  organisms,  exposure  may  better  correlate  with  the  concentra¬ 
tion  related  to  total  (whole)  sediment,  a  particular  particle  size  fraction,  or  cation  ex¬ 
change  capacity.  This  is  especially  true  for  polar  and  ionic  chemicals.  For  cationic  metals, 
sediment  bioavailability/exposure  may  be  related  to  the  acid  volatile  sulfide  (AVS)  con¬ 
tent  of  the  sediments  (Di  Toro  etal  1990).  When  the  molar  concentration  of  the  AVS 
exceeds  the  molar  concentration  of  the  simultaneously  extracted  metal  (SEM),  the  bio- 
available  metal  concentration  has  been  shown  to  be  low  and  below  toxic  effects  levels.  In 
addition  to  the  bioavailability  measures  mentioned  above,  other  factors  to  consider  in 
defining  dose  include  habitat  characteristics  (substrate,  and  percent  and  type  of  organic 
matter),  organism  function  (shredders,  gatherers,  filterers),  and  exposure  through  the 
food  web  (dietary  exposure  and  trophic  transfer). 

Chemical  loads,  distribution,  and  compartment  information  can  be  used  to  set  an  upper 
level  of  exposure  for  comparison  to  existing  data  on  biological  effects  for  a  wide  variety 
of  species.  Setting  a  load  (dose)  for  a  “limit  test”  based  upon  this  comparison  with  effects 
data  and  on  some  percentage  of  hkelihood  for  that  load  to  occur  allows  for  a  first-level  or 
screening-level  assessment  of  possible  biological  effects.  Similarly,  a  range  of  exposure 
estimates  can  be  used  for  comparative  purposes  with  effects  data  and  for  setting  concen¬ 
trations  in  a  full  dose-response  effects  study  to  be  used  in  a  more  thorough  sediment  risk 
analysis.  Both  direct  and  indirect  routes  of  exposure  need  to  be  considered  when  expo¬ 
sure  characterization  is  performed  and  doses  for  toxicity  tests  are  selected. 
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4.5.3  Tiered  exposure  characterization 

A  tiered  approach  for  collecting  data  to  characterize  exposure  is  often  optimal  to  develop 
cost-effective  and  scientific  data.  A  suggested  three-tier  approach  for  characterizing  expo¬ 
sure  in  a  sequential  manner  in  conjunction  with  effects  data  is  depicted  in  Table  4-2.  Re¬ 
sults  of  the  exposure  estimates  at  each  tier  are  used  in  conjunction  with  effects  data  to 
determine  whether  or  not  there  is  need  for  further  refinement  of  the  estimates.  If  an  ad¬ 
equate  margin  of  safety  exists,  no  further  data  may  be  required.  This  decision  is  often 
made  at  the  discretion  of  the  risk  manager  with  input  from  the  risk  assessor.  Generally, 
if  a  reasonable  worst-case  exposure  estimate  does  not  exceed  the  measured  or  estimated 
chronic  effect  level  for  the  most  sensitive  species  tested,  it  is  presumed  that  an  adequate 
safety  margin  exists  and  no  further  exposure  assessment  is  needed. 

Tier  1  estimates  of  exposure  concentrations  are  based  upon  volume  of  production,  use 
and  anticipated  release  rates,  and  are  usually  obtained  using  fairly  simple  partitioning 
models  based  on  physical-chemical  properties.  Alternatively,  they  may  be  based  upon 
other  compounds  with  similar  properties  and  uses.  Tier  2  estimates  of  exposure  are  typi¬ 
cally  obtained  by  improving  the  quality  and/or  quantity  of  data  used  as  inputs  to  com¬ 
puter  models.  For  example,  site-specific  data  on  stream  flow,  sediment  type,  or  release 
rate  may  be  incorporated.  Additional  fate  data  may  be  collected  on  rates,  e.g,,  rates  of  hy¬ 
drolysis,  biodegradation,  or  photolysis.  The  models  used  may  sometimes  be  calibrated 
with  another  dataset  for  a  similar  chemical  for  which  there  are  environmental  monitor¬ 
ing  data.  Tier  3  estimates  of  exposure  may  require  field-collected  data  to  validate  models 
and  to  determine  actual  exposures  following  the  use  of  a  product.  Often  this  is  done  when 
there  is  a  potential  for  exposure  to  exceed  the  threshold  effects  levels,  i.e.,  when  the  mar¬ 
gin  of  safety  is  small.  Environmental  monitoring  may,  in  some  cases,  follow  a  period  of 
test  application  or  limited  commercial  usage  of  a  product. 

4.6  Effects  analyses 

Effects  analyses  for  benthic  organisms  should  be  carried  out  in  conjunction  with  expo¬ 
sure  analyses  to  ensure  that  similar  assumptions  are  used  and  questions  are  addressed  in 
parallel.  Both  should  be  clearly  focused  upon  the  problem  formulated  and  commensu¬ 
rate  with  the  level  of  data  required  in  each  tier.  The  effects  characterization  should  con¬ 
sider  the  duration  of  exposure  (acute  or  chronic)  and  the  potential  for  bioaccumulation. 
Chronic  assessments  typically  include  survival,  growth,  and  reproduction  endpoints. 
Additional  studies  or  measurements  may  be  needed  to  assess  bioaccumulation  end¬ 
points.  A  major  challenge  in  sediment  effects  analyses  is  identifying  and  quantifying  the 
relevant  route  of  exposure  to  a  test  material,  which  may  differ  widely  across  benthic  taxa 
(Adams  etal.  1985).  Measurement  of  exposure  requires  detailed  sampling  and  analytical 
protocols  to  obtain  reliable  data  for  correlating  effects  with  exposure  concentrations  in  a 
given  matrix. 

Like  the  exposure  analysis,  a  three-tiered  approach  that  enables  decisions  to  be  made  at 
successive  stages  and  precludes  unnecessary  testing  is  presented. 
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Table  4-2  Tiered  approach  to  collecting  data  for  risk  characterization 


Tier 

Exposure  assessment 

Decision  alternatives 

1 

Screening-level  EECs 
(computer  estimations) 

a)  Margin  of  safety  is  large;  exposure 
characterization  is  complete. 

b)  Margin  of  safety  is  small;  refine  exposure 
estimate. 

2 

Predictive  EECs;  estimates  based  on 
improved  model  simulations 

a)  Margin  of  safety  is  large;  exposure 
characterization  is  complete. 

b)  Margin  of  safety  is  small;  refine  exposure 
estimate. 

3 

Confirmed  EECs;  EEC  confirmed  by 
environmental  monitoring 

a)  Margin  of  safety  is  large;  exposure 
characterization  is  complete. 

b)  Margin  of  safety  is  small;  collect  additional 
data  or  consider  risk  mitigation  options. 

4.6.1  Tier  1 

The  first  tier  in  effects  analysis  is  a  screening  step,  which  summarizes  existing  effects  data 
(measurement  endpoints)  for  aquatic  species,  both  benthic  and  pelagic,  and  which  gen¬ 
erates  predicted  effects  data  from  relevant  QSARs.  These  data  are  used  in  conjunction 
with  PECs  from  a  Tier  1  exposure  assessment.  The  porewater  concentration  is  usually 
compared  to  the  effects  endpoint  measured  for  the  most  sensitive  species  in  the  aquatic 
effects  assessment  {e.g.,  an  acute  measurement  endpoint  [LCx,  ECx]  in  a  daphnid,  fish,  or 
benthic  invertebrate  study).  Additionally,  the  porewater  concentration  could  be  used  to 
estimate  tissue  residue  levels  in  species  of  interest  by  using  the  chemicals  log  P. 

Due  to  the  relatively  high  uncertainty  associated  with  screening-level  assessments,  assess¬ 
ment  factors  of  1, 10,  or  100  are  often  applied  by  regulatory  authorities  in  order  to  ac¬ 
count  for  extrapolations  from  acute  to  chronic  data,  laboratory  to  field  exposure 
scenarios,  and  extreme  organism  sensitivities.  The  adoption  and  magnitude  of  assess¬ 
ment  factors  used  for  screening-level  assessments  reflects  the  regulatory  policies  of  par¬ 
ticular  countries  or  regional  regulatory  authorities  {e.g,,  the  European  Union).  At  the 
present  time,  there  is  no  broad  international  consensus  on  the  use  of  assessment  factors. 

This  method  of  screening  applies  to  nonionic  organic  chemicals.  Greater  uncertainty  is 
associated  with  the  use  of  predictive  models  for  metals  and  ionic  organic  compounds 
than  for  nonionic  organics.  In  these  cases,  measured  sediment  partition  coefficient  (K^) 
values  should  be  used  to  estimate  porewater  concentrations.  In  all  cases,  the  magnitude 
of  the  application  factor  should  vary  according  to  the  uncertainty  of  the  exposure  esti¬ 
mate. 
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4.6.2  Tier  2 

For  Tier  2  effects  analyses,  direct  measurement  of  sediment  toxicity  (and  bioaccumula¬ 
tion)  in  benthic  organisms  is  often  used  for  assessment  purposes  (see  Table  4-1).  Greater 
emphasis  is  placed  on  chronic  sediment  studies  because  these  tests  better  simulate  the 
chemical  equilibria  established  between  sediment,  water,  and  tissue.  However,  chronic 
toxicity  and  bioaccumulation  test  methods  for  sediments  are  still  in  a  developmental 
phase. 

Where  possible,  standard  test  methods  with  associated  validity  or  performance  criteria 
are  employed.  Criteria  for  species  selection  have  been  outlined  in  a  SETAC-Europe  guid¬ 
ance  document  on  sediment  toxicity  tests  (Hill  et  al  1993)  and  by  the  American  Petro¬ 
leum  Institute  (API  1994). 

Recommended  test  species  include  10-d  lethality  and  growth  tests  with  chironomids  and 
amphipods  for  freshwater  applications.  Amphipods,  polychaetes,  and  bivalves  are  often 
tested  for  marine/estuarine  applications.  Standard  methods  have  been  published  or  are 
in  preparation  for  a  number  of  test  protocols  (Table  3-2).  Ideally,  test  species  with  differ¬ 
ent  feeding  strategies  should  be  used,  but  this  may  be  dictated  by  the  availability  of  stan¬ 
dard  methods  and  the  scope  of  the  assessment.  Water  column  toxicity  tests  of  sediment 
extracts  provide  an  additional  approach  for  some  sediment  assessment  applications.  Due 
to  additional  uncertainties  associated  with  extracts  preparation  and  interpretation  of 
results,  however,  whole  sediment  tests  are  generally  preferred. 

Both  formulated  and  natural  sediments  have  been  used  for  effects  analyses.  Issues  of 
standardization  within  and  between  product  assessments  yet  need  to  be  addressed.  If 
natural  sediments  are  used,  key  physical-chemical  properties  such  as  OC,  particle  size 
distribution,  ammonia,  and  pH  should  be  measured.  The  sediments  should  be  collected, 
handled,  stored,  and  dosed  according  to  existing  standard  methods  (see  Table  3-2).  The 
sediment  should  be  dosed  with  a  concentration  range  from  which  a  dose-response  may 
be  observed.  The  concentration  range  may  be  predicted  from  existing  aquatic  effects  data 
or  preliminary  range-finding  tests. 

Replication,  reference  substances,  and  control  treatments  should  be  used  according  to 
the  standard  test  method.  Measurement  endpoints  and  treatment  of  data  should  be  con¬ 
sistent  with  methods  used  in  aquatic  risk  assessment.  Ideally,  a  sufficient  amount  of  ef¬ 
fects  data  can  be  obtained  or  generated  to  allow  for  a  probabilistic  assessment  approach. 
The  USEPA  water  quality  criteria  employ  a  probabilistic  approach  that  protects  95%  of 
the  species.  SETAC  (Baker  et  al.  1994)  has  recommended  the  use  of  a  somewhat  similar 
approach  based  upon  90th  percentile  toxicity  data.  When  there  is  a  very  limited  dataset 
available,  the  most  sensitive  measure  should  be  used  for  risk  characterization. 

For  nonpolar  organics,  effect  concentrations  should  be  normalized  to  sediment  OC  con¬ 
tent.  This  would  allow  comparison  with  PECs  also  calculated  on  the  basis  of  OC.  For  met¬ 
als  and  polar  organics,  models  for  calculating  PECs  should  be  parameterized  using  the 
same  physical-chemical  properties  and  parameters  of  the  test  sediment  for  both  the  ef¬ 
fects  and  exposure  characterization.  In  many  cases,  test  sediments  representing  worst 
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case  and  typical  case  {e.g.,  low  and  average  OC  content,  respectively;  low  and  average  cat¬ 
ion  exchange  capacity;  low  and  average  AVS  concentration)  canvass  the  full  range  of  po¬ 
tential  effects.  These  tests  should  be  guided  by  the  PEC  for  sediments  and  by  the  need  for 
dose-response  data. 

4.6.3  Tier  3 

Tier  3  effects  analyses  include  higher  level  or  simulation  tests  that  may  be  used  to  assess 
application-specific  risks.  Such  tests  often  include  microcosm,  mesocosm,  or  field  stud¬ 
ies.  The  need  for  such  tests  would,  in  part,  be  dependent  upon  the  potential  for  the  prod¬ 
uct  to  remain  in  the  environment  for  extended  periods  of  time.  The  tests  are  designed  to 
address  particular  toxicity,  bioaccumulation,  or  exposure  questions  identified  in  prior 
testing.  They  may  include  alternative  approaches  for  application  of  products  {e,g,  over¬ 
spray  for  pesticides,  addition  of  slurry  or  suspended  solids  for  drilling  fluids),  exposure 
of  specific  organisms  of  concern,  alternative  exposure  regimes  such  as  in  situ  or  multispe¬ 
cies  testing,  or  assessment  of  field  effects.  Any  unique  fate  pathways,  toxicological  con¬ 
cerns  or  mechanisms,  or  action  identified  in  problem  identification  should  be  reflected 
in  the  test  design.  For  this  level  of  testing,  sediment  test  methods  are  largely  in  a  develop¬ 
mental  phase. 

4.7  Risk  characterization 

Risk  characterization  is  the  final  phase  of  sediment  assessments  for  products.  Recom¬ 
mendations  and  considerations  for  risk  characterization  in  ERA  have  been  reviewed  by 
Hoffman  et  al  (1995)  and  are  fully  relevant  to  product  assessments  of  potential  sediment 
risks.  These  are  briefly  summarized  below,  with  particular  emphasis  on  how  they  may  be 
tailored  specifically  to  products  and  sediments. 

In  the  risk  characterization,  the  exposure  and  effects  assessments  are  integrated  to  yield 
an  expression  of  the  likelihood  of  adverse  effects  to  benthic  species,  populations,  or  com¬ 
munities,  depending  on  the  nature  and  scope  of  the  risk  assessment.  If  an  additional  or 
alternative  purpose  of  the  assessment  is  to  evaluate  the  potential  for  bioaccumulation  in 
benthic  species  or  biomagnification  through  associated  food  webs,  the  risk  characteriza¬ 
tion  applies  the  exposure  estimate  to  derive  an  estimated  steady-state  tissue  concentra¬ 
tion  in  the  species  of  interest.  In  either  case,  the  objective  of  the  risk  characterization  is  to 
describe  the  risk  in  terms  of  the  assessment  endpoint  identified  in  the  problem  formula¬ 
tion  phase.  In  addition,  the  risk  assessor  provides  an  interpretation  of  the  ecological  sig¬ 
nificance  of  the  identified  risks  to  benthic  populations.  This  may  include  consideration 
of  the  integrity  of  the  benthic  population  or  community  as  well  as  possible  implications 
to  other  (non-benthic)  components  of  the  ecosystem.  However,  extrapolations  of  risks 
beyond  the  population  level  tend  to  be  speculative  when  only  single-species  effects  data 
are  available. 

Both  qualitative  and  quantitative  descriptors  are  appropriate  and  necessary  in  character¬ 
izing  risks.  To  the  extent  feasible,  the  full  range  of  risks  should  be  communicated,  includ¬ 
ing  those  likely  to  occur  in  representative  scenarios  of  sediment-mediated  exposure  as 
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well  as  those  pertaining  to  highly  exposed  or  highly  susceptible  benthic  populations.  In 
the  context  of  tiered  assessments  for  products,  the  scope  of  the  risk  assessment  must  be 
clearly  identified.  Screening-level  assessments  typically  result  in  deterministic  expres¬ 
sions  of  risk  where  only  one  or  a  few  point  estimates  of  risk  are  provided.  Higher  level 
tiers  of  assessment  may  allow  estimation  and  characterization  of  probabilistic  distribu¬ 
tions  of  exposure  and  effects,  as  summarized  by  SETAC  (Baker  etal  1994). 

Possible  outcomes  of  the  risk  assessor-risk  manager  interface  include  the  following: 

•  When  risk  probability  is  low  and  there  is  an  adequate  margin  of  safety,  the 
process  stops. 

•  When  risk  probability  is  intermediate  or  high  or  when  margins  of  safety  are 
unclear  or  unknown,  more  assessment  is  generally  needed  and  risk  mitigation 
action  may  ultimately  be  needed. 

•  When  additional  assessment  is  needed,  it  may  take  the  form  of  additional 
refinements  of  the  risk  assessment  or  verification  of  the  effectiveness  of  pro¬ 
posed  mitigation  actions. 

In  cases  where  multiple  and  independent  estimates  of  sediment  exposure  or  effects  exist 
{e.g.,  results  from  several  models  or  toxicity  data  on  multiple  species),  and  in  order  to 
account  for  seemingly  conflicting  data  due  to  variability  or  uncertainty,  a  weight-of-evi- 
dence  approach  may  be  necessary.  The  relevance  of  the  measurement  endpoints  to  the 
assessment  endpoint  should  be  described  in  the  risk  characterization.  This  may  include 
a  ranking  or  prioritization  of  measurement  endpoints.  For  example,  toxicity  data  gained 
from  direct  sediment  testing  of  a  benthic  species  may  be  considered  more  relevant  than 
toxicity  estimates  inferred  from  extrapolations  from  pelagic  test  species. 

In  retrospective  assessments  of  existing  products,  evidence  for  causal  relationships  be¬ 
tween  product  and  effects  to  benthic  communities  should  be  clearly  delineated  where 
possible.  When  the  causal  relationship  is  not  clear,  this  must  be  communicated  in  the 
sediment  risk  characterization. 

Spatial  and  temporal  distributions  of  the  risk  {e.g,  constant  versus  intermittent  exposure, 
homogeneous  versus  contagious  distribution  of  organisms,  seasonal  occurrence  of  par¬ 
ticularly  sensitive  benthic  life  forms)  should  be  identified.  The  expression  of  the  risks 
to  benthic  populations  is  largely  dictated  by  the  type  and  form  of  sediment  exposure  and 
effects  data  available.  Exposure  may  be  expressed  in  various  forms,  such  as  total  (whole) 
sediment  concentration,  particle  size  fraction  concentration,  porewater  concentration,  or 
OC-normalized  whole  sediment  concentration.  It  is  difficult  or  often  invalid  to  attempt 
extrapolations  between  or  among  different  expressions  of  exposure  due  to  the  potential 
moderating  effects  of  key  environmental  parameters  upon  exposure  or  bioavailability. 
These  limitations  in  extrapolation  should  be  clearly  communicated  in  the  risk  character¬ 
ization. 

Risk  characterizations  should  clearly  relate  to  and  communicate  the  problem  formulated 
and  the  scope  of  the  risk  assessment.  Relevant  questions  include  these: 
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•  Was  the  initial  sediment  problem  addressed? 

•  What  aspects  could  not  be  addressed,  and  do  these  limit  or  preclude  the 
estimation  of  risks  or  limit  the  scope  of  the  assessment? 

•  On  the  basis  of  the  data  and  information  obtained  in  the  effects  and  exposure 
assessment,  was  the  initial  problem  accurately  formulated? 

•  Can  the  risks  to  benthic  populations  be  expressed  in  a  quantitative  manner? 

•  Can  risk  management  or  mitigation  options  be  identified  to  reduce  potential 
risk  associated  with  product  usage,  if  necessary? 

Default  assumptions  used  in  the  effects  and  exposure  assessment  should  be  clearly  com¬ 
municated.  Extrapolations  of  the  data  (e.g.,  endpoint  to  endpoint,  chemical  to  chemical, 
species  to  species,  population  to  community)  should  be  identified.  Moderating  effects  of 
physical-chemical  characteristics  of  the  benthic  environment  (e.g.,  OC  content,  particle 
size),  limited  bioavailability  or  distribution  of  the  product,  or  biological  factors  (life  his¬ 
tory  stage  sensitivity,  behavioral  mechanisms  or  avoidance)  should  be  described. 

The  limitations  of  the  sediment  assessment,  including  sampling  biases  and  model  ex¬ 
trapolations  of  the  exposure  assessment,  should  be  clearly  identified.  Uncertainties  of  the 
effects  and  exposure  assessments  due  to  data  and  knowledge  gaps,  and  any  assumptions 
used  to  bridge  these  gaps,  should  be  described. 

For  sediment  risk  assessments  of  products,  complete  characterization  of  the  chemical 
technology  should  be  described.  Key  aspects  include  the  name  and  structure  of  the 
chemical,  chemical  mixtures,  and  impurities;  formulation,  use,  or  disposal  processes  that 
may  affect  environmental  fate,  distribution  or  bioavailability;  the  stage  of  the  product  s 
life  cycle  being  addressed  (e.g.,  sourcing,  discharges  from  manufacturing  facilities,  both 
intended  and  unintended;  consumer  use  and  disposal);  usage  volume,  both  by  individual 
corporations  and  industry-wide;  expected  degradation  and  transformation  processes  and 
metabolites;  availability  of  QSAR  data;  and  regulatory  considerations  or  requirements 
that  influence  the  design  of  the  study  design. 

Safety  factors  (i.e.,  application  factors)  have  been  conventionally  used  in  ERA  as  an  ap¬ 
proach  for  establishing  toxicity  threshold  concentrations  above  which  concern  or  risk 
may  exist.  This  is  usually  done  when  there  is  concern  that  the  available  effects  data  are 
not  truly  representative  of  the  ecosystem  or  species  at  risk  or  when  there  is  uncertainty 
associated  with  the  available  effects  data.  Safety  factors  have  also  been  used  with  limited 
effects  datasets  to  estimate  threshold  concentrations  for  the  purpose  of  determining  the 
need  for  higher  tiered  testing  or  to  assist  in  the  selection  of  a  particular  risk  management 
option.  Safety  factors  can  be  used  in  considering  risks  of  products  to  benthic  populations 
or  communities.  Safety  factors  may  be  applied  in  the  risk  characterization  for  products 
or  in  subsequent  discussions  between  the  risk  assessor  and  risk  manager,  in  order  to 
ensure  organism  protection  and  to  provide  the  risk  manager  with  a  basis  for  decision¬ 
making,  Because  the  rationale  for  and  magnitude  of  safety  factors  vary  from  one  risk  as¬ 
sessment  to  another  (and  often  from  country  to  country),  it  is  important  to  document 
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the  rationale  and  method  used  in  applying  safety  factors.  It  should  also  be  noted  that 
safety  factors  are  not  always  needed,  depending  upon  the  extent  of  the  effects  data  col¬ 
lected. 

The  risk  characterization  forms  the  basis  for  discussions  with  the  risk  manager,  whether 
within  the  private  or  public  sector.  For  this  reason,  it  is  important  that  risks  be  identified 
in  terms  relevant  to  the  assessment  endpoints  and  that  the  assumptions,  uncertainties, 
and  confidence  in  the  assessment  be  clearly  presented. 

4.8  Application  of  product  risk  assessment  for  sediments: 
integration  of  risk  assessment  and  risk  management 

The  prospective  risk  assessment  process  for  chemical  products  is  perhaps  best  viewed  as 
the  application  of  science  as  part  of  an  informed  risk  management  program.  This  view  is 
supported  by  emerging  risk-based  approaches  to  environmental  regulations,  which  pro¬ 
mote  increased  integration  of  societal  values,  science,  and  risk  mitigation  practices  for 
environmental  decision-making.  This  integrated  decision-making  process  goes  beyond 
the  traditional  scientific  boundaries  of  risk  assessment  to  include  risk  mitigation  and  risk 
management.  Within  this  context,  the  following  definitions  apply: 

•  Risk  assessment  is  a  science-based  process  that  consists  of  effects  and  exposure 
analyses  which  are  ultimately  integrated  into  a  risk  characterization. 

•  Risk  mitigation  is  an  activity  that  involves  remediation  or  mitigation  measures 
to  reduce  or  eliminate  adverse  environmental  impact. 

•  Risk  management  is  a  policy-based  approach  that  identifies  risk  assessment 
questions  and  assessment  endpoints  to  protect  human  health  and  the  environ¬ 
ment.  It  utilizes  the  risk  characterization  decisions  and  incorporates  social, 
economic,  political,  and  legal  factors  that  impinge  on  or  influence  the  final 
decision  and  selects  regulatory  actions. 

The  integrated  decision-making  process  represents  a  fundamental  shift  in  thinking  that 
focuses  attention  on  making  timely,  environmentally  protective  decisions  through  better 
integration  of  risk  assessment,  risk  management,  and  risk  mitigation  options.  For  prod¬ 
ucts  that  enter  the  marketplace  with  the  potential  to  sorb  to  sediments,  risk  characteriza¬ 
tion  can  be  integrated  with  risk  policy  decisions  in  a  number  of  ways.  This  can  include  a 
decision  of  no  risk  and  no  need  to  consider  mitigation;  conversely,  a  decision  of  moder¬ 
ate  or  significant  risk  could  be  reached,  which  would  indicate  the  need  to  consider  the 
magnitude  of  the  risk  and  ways  to  mitigate  the  potential  for  exposure.  There  are  a  num¬ 
ber  of  options  available  which  can  limit  the  potential  for  a  product  to  reach  the  environ¬ 
ment  or  which  can  influence  the  potential  for  sediment  concentrations  reaching  levels  of 
concern.  For  additional  details  on  product  mitigation,  the  reader  is  referred  to  SETAC 
(Baker  et  al  1994). 

While  scientists  are  variously  engaged  in  basic  and  applied  issues  concerning  the  devel¬ 
opment  and  refinement  of  ERA  concepts  and  practices,  they  rarely  serve  as  the  final  de¬ 
cision-makers  in  risk  management  actions.  Rather,  scientists  provide  technical  opinions 


SETAC  Press 


Session  2;  Product  Safety  Assessment,  Chapter  4 


39 


and  support  to  risk  managers  who  must  weigh  the  scientific  risk  against  societal  values 
dictated  by  economic  benefits  and  constraints,  existing  precedents  of  law  and  policy,  and 
often  conflicting  opinions  of  different  special  interest  groups.  In  some  respects,  risk  miti¬ 
gation  options  act  as  a  bridge  between  risk  assessment  and  risk  management,  requiring 
practical  solutions  and  corrective  actions  to  environmental  problems.  Recognizing  and 
understanding  that  risk  assessors  and  risk  managers  have  different  roles  and  responsibili¬ 
ties  should  go  a  long  way  toward  improving  the  decision-making  process. 

4.9  Research  needs 

There  are  several  areas  in  the  sediment  product  risk  assessment  framework  that  would  be 
improved  by  additional  research  and  development.  General  consensus  areas  include 
these: 

•  Development  of  predictive  models  for  estimating  exposure/bioavailability  for 
metals  and  ionic  substances  sorbed  to  sediments 

•  A  better  understanding  of  the  routes  of  exposure  of  benthic  invertebrates  to 
sediment-sorbed  chemicals 

•  Development  and  standardization  of  additional  chronic  sediment  toxicity  tests 

•  Identification  and  standardization  of  an  appropriate  formulated  sediment 

•  Determination  of  limits  of  acceptability  for  natural  sediments 

•  Standardization  of  sediment  spiking  techniques 

•  Development  and  standardization  of  higher  level  tests  (e.g. ,  microcosms  and 
mesocosms) 

•  International  harmonization  of  test  methodologies  and  sediment  characteriza¬ 
tions 
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5.1  Introduction 

Dredgingis  the  process  of  excavating  sediment  from  a  waterway,  often  involving  transpor¬ 
tation  of  the  excavated  dredged  material  to  another  site.  Dredging  occurs  most  often  to 
maintain  or  increase  the  depth  of  waterways  to  provide  safe  passage  for  vessels,  thus  the 
term  navigation  dredging.  Because  sediments  are  the  ultimate  reservoir  for  many  contami¬ 
nants,  sediments  can  be  the  subjects  of  environmental  concerns.  These  concerns  can  fo¬ 
cus  on  navigation  dredging  when  contaminated  sediments  occur  in  a  navigation  channel 
or  can  lead  to  remediation  dredging  of  sediments  not  associated  with  navigation  needs. 
Remediation  dredging  is  not  specifically  discussed  in  this  section  on  navigation  dredging, 
although  the  fundamental  risk  assessment  process  would  be  similar  but  would  be  con¬ 
ducted  in  a  different  risk  management  context. 

A  dredged  material  manager  must  make  decisions  regarding  the  most  appropriate  loca¬ 
tion  for  placement  of  dredged  material.  Such  decisions  are  often  constrained  by  the  ur¬ 
gency  for  dredging  and  by  the  cost  and  environmental  consequences  of  various 
placement  alternatives.  These  constraints  result  from  international  treaties,  national 
laws,  regulations  and  policies,  and  pressures  from  various  stakeholder  groups.  This  sec¬ 
tion  provides  a  brief  environmental  overview  of  navigation  dredging  and  illustrates  two 
different  regulatory  approaches  taken  in  the  United  States  and  the  Netherlands,  followed 
by  an  examination  of  the  application  of  ERA  methodology  to  dredged  material  evalua¬ 
tions. 

5.2  Existing  approaches  for  evaluating  dredged  material 

5.2.1  Overview 

Contaminants  enter  aquatic  systems  through  a  variety  of  point  and  nonpoint  sources  and 
in  time  may  become  associated  with  sediments  by  a  variety  of  chemical  or  physical 
mechanisms.  Dredged  material  encompasses  a  wide  range  of  sediment  types  that  vary  in 
physical  and  chemical  properties  and  amount  of  both  anthropogenic  pollutants  and 
naturally  occurring  toxicants  {e.g,  NH^,  H^S  that  may  also  be  enriched  in  polluted  sedi¬ 
ments).  Sediment-contaminant  interactions  are  both  complex  and  dynamic.  Geochemi¬ 
cal  processes  such  as  partitioning  and  organic  complexation  directly  affect  the  biological 
availability  of  sediment-associated  contaminants.  Only  the  biologically  available 
(bioavailable)  fraction  is  relevant  to  the  actual  contaminant  exposure  that  may  ultimately 
elicit  adverse  ecosystem  impacts. 
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When  sediment  contamination  is  an  issue  in  navigation  dredging,  it  is  rare  for  there  to  be 
only  a  single  contaminant  of  concern;  typically,  there  is  a  complex  mixture  of  several 
classes  of  contaminants  (e.g„  metals,  polycyclic  aromatic  hydrocarbons  [PAHsl,  chlori¬ 
nated  compounds)  that  interact  with  the  segment  matrix  in  complex  ways.  The  compo¬ 
nents  and  relative  concentrations  in  the  mixture,  and  the  interactions  with  the  sediment 
matrix,  differ  from  sediment  to  sediment.  Toxicity  reference  values  and  other  quantifiers 
of  effect  in  risk  assessments  have  almost  always  been  developed  on  a  single  chemical 
basis:  the  effect  of  each  chemical  acting  alone  is  evaluated,  and  these  effects  are  usually 
either  assumed  to  be  independent  or  they  are  combined  by  some  mathematical  ap¬ 
proach.  At  the  same  time,  sediment  toxicity  and  other  bioeffects  tests  in  most  dredged 
material  evaluations  have  typically  been  whole  sediment  tests  that  measured  the  total 
effect  of  the  entire  “black  box”  of  contaminants  and  sediment  matrix  interactions,  with 
no  ability  to  attribute  effect  to  a  specific  cause  other  than  to  say  “this  sediment  caused 
this  effect  in  this  test”  (however,  see  Chapter  16). 

When  navigation  dredging  is  considered,  it  is  typically  true  that  the  sediment  under 
evaluation  will  be  dredged  regardless  of  its  contamination  status;  economic  factors  usu¬ 
ally  force  dredging,  and  the  issue  becomes  one  of  environmentally  responsible  manage¬ 
ment  of  the  dredged  material.  The  dredged  material  could  theoretically  be  placed  either 
on  land  or  at  another  location  in  the  water.  The  physical-chemical  conditions  affecting 
bioavailability  of  contaminants  and  the  receptors  of  concern  are  quite  different  on  land 
and  in  the  water,  resulting  in  different  kinds  and  degrees  of  risk  from  the  same  dredged 
material,  depending  on  whether  it  were  to  be  placed  in  a  land  or  water  environment. 
Even  though  such  cross-media  risk  considerations  are  crucial  to  environmentally  sound 
dredged  material  management,  they  substantially  broaden  the  issues  to  be  considered 
and  are  beyond  the  scope  of  this  chapter,  which  considers  the  application  of  risk  assess¬ 
ment  to  dredged  material  placed  in  an  aquatic  environment. 

The  entire  issue  of  risk  management  is  fundamental  to  environmental  protection  and  is 
the  primary  reason  risk  assessments  are  being  considered  in  the  context  of  navigation 
dredging.  However,  risk  management  is  a  subject  unto  itself  and  is  not  included  in  this 
discussion,  except  to  note  the  necessity  of  involving  risk  managers  and  all  appropriate 
stakeholders  in  the  problem  formulation  process  (Section  5.3.2)  to  avoid  conducting  a 
risk  assessment  of  little  practical  use  due  to  improper  focus. 

This  chapter  reviews  the  application  of  scientific  understanding  and  technology  in 
dredged  material  decision-making  within  the  regulatory  and  management  constraints  of 
the  United  States  and  the  Netherlands.  This  review  is  used  as  a  context  for  applying  the 
ERA  framework  to  dredged  material  operations  and  in  making  conclusions  and  recom¬ 
mendations.  The  remainder  of  this  section  summarizes  the  existing  approach  to  evaluat¬ 
ing  dredged  material  for  open-water  disposal  in  the  U.S.  and  the  Netherlands,  especially 
focusing  on  the  relationship  between  management  constraints  and  scientific  understand¬ 
ing  in  dredged  material  decision-making. 
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5.2.2  United  States  approach 

Ports  and  harbors  and  their  associated  waterways  play  a  vital  role  in  U.S.  economic,  de¬ 
fense,  and  recreational  interests.  In  1992,  U.S.  ports  handled  2.9  billion  metric  tons  of 
cargo  for  a  total  import/export  value  of  $487  billion  and  supported  approximately  15 
million  jobs,  while  more  than  94  million  Americans  participated  in  some  form  of  recre¬ 
ational  boating  or  fishing  activity  (American  Association  of  Port  Authorities  1994).  Main¬ 
taining  adequate  navigation  depths  in  ports  and  waterways  is  fundamental  to  national 
interests. 

As  sediments  are  transported  and  settle  in  channels  and  basins,  periodic  maintenance 
dredging  is  required  to  insure  safe  passage  for  shipping.  Excavated  sediments  or  dredged 
materials  are  removed  from  channels  and  transported  to  another  location.  There  are 
more  than  4000  km  of  navigation  channels  and  over  400  harbors  in  the  United  States 
that  may  require  maintenance  dredging  in  any  one  year.  This  activity  generates  approxi¬ 
mately  300  million  m^  of  dredged  material  that  must  be  relocated  annually.  About  80%  of 
dredged  material  is  moved  to  other  designated  sites  in  the  aquatic  environment. 

5.2.2. 7  Regulatory  context 

Dredging  and  placement  of  dredged  material  are  regulated  in  accordance  with  a  number 
of  environmental  statutes  including  the  National  Environmental  Policy  Act  of  1969 
(NEPA),  the  Clean  Water  Act  (CWA),  and  the  Marine  Protection,  Research,  and  Sanctu¬ 
aries  Act  of  1972  (MPRSA).  In  addition,  the  United  States  is  signatory  to  the  London 
Convention,  which  governs  the  disposal  of  material  in  ocean  waters.  The  U.S.  Army 
Corps  of  Engineers  (USACE)  has  primary  responsibility  for  all  dredged  material  permit¬ 
ting  activities  as  well  as  responsibility  for  all  aspects  of  federal  navigation  dredging  activi¬ 
ties.  The  USACE,  using  federal  funds,  performs  the  dredging  necessary  to  maintain 
commercial  waterways  throughout  the  United  States.  Any  party  (port  authority,  indus¬ 
trial  facility,  marina,  etc.)  that  wishes  to  access  the  federal  channel  must  obtain  a  permit 
from  the  USACE  to  dredge  their  own  berths,  turning  basins,  and  access  channels.  Many 
dredged  areas  experience  rapid  sedimentation  and  require  relatively  frequent  dredging 
to  maintain  safe  depths  for  navigation.  Most  dredging  projects  require  environmental 
evaluation,  public  notice  and  comment,  opportunity  for  a  public  hearing,  and  permit 
issuance.  The  USEPA  is  responsible  for  developing  environmental  criteria  and  guidelines 
used  by  the  USACE  for  permit  evaluation  under  the  CWA  and  MPRSA,  the  two  major 
statutes  governing  the  disposal  of  dredged  material.  USEPA  is  also  responsible  for  envi¬ 
ronmental  oversight  of  dredging  projects  and  has  the  authority  to  veto  projects  it  deter¬ 
mines  not  to  be  in  compliance  with  environmental  requirements. 

The  environmental  policy  goals  for  dredged  material  management  are  established  in  the 
CWA  and  the  MPRSA.  While  the  language  in  these  laws  differs  slightly,  the  underlying 
concept  in  both  laws  is  that  placement  of  dredged  material  result  in  “no  unacceptable 
adverse  affects.”  The  general  policy  considerations  in  the  two  laws  provide  neither  unam¬ 
biguous  statements  of  the  elements  of  the  ecosystem  that  are  to  be  protected  nor  quan- 
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titative  or  qualitative  descriptions  of  the  nature  of  protection  that  is  to  be  afforded  those 
elements. 

NEPA,  CWA,  and  MPRSA  and  their  implementing  regulations  (40  U.S.  Code  of  Federal 
Regulations  [CFR]  1500-1508, 40  CFR  230  and  40  CFR  220-228,  respectively)  each  re¬ 
quires  evaluation  of  human  health  and  environmental  impacts  resulting  from  proposed 
projects,  comparison  with  other  alternatives,  and  an  opportunity  for  public  review  and 
comment.  In  1992,  the  USEPA  and  USAGE  issued  a  generalized,  consistent,  technical 
framework  document  (USEPA  and  USAGE  1992).  This  document  is  used  to  evaluate  en¬ 
vironmental  impacts  of  dredged  material  placement  in  open  water,  confined  disposal 
facilities,  or  beneficial  uses  listed  above.  This  document  describes  the  exposure  pathways 
for  each  option  that  should  be  assessed  to  determine  the  potential  for  contaminants  in 
dredged  material  to  cause  unacceptable  adverse  impacts  to  human  health  or  the  environ¬ 
ment.  In  general,  the  evaluations  are  concerned  with  the  potential  for  toxicity  and  for 
long-term  or  secondary  effects  due  to  bioaccumulation  of  contaminants  in  the  dredged 
material. 

The  three  principal  components  of  dredged  material  management  embodied  in  the  regu¬ 
lations  are  1)  site  selection  and  designation,  2)  dredged  material  evaluation  (permitting), 
and  3)  site  monitoring.  The  site  designation  process  consists  of  baseline  studies  of  the  en¬ 
virons  of  the  proposed  site.  The  site  with  the  least  potential  for  unacceptable  adverse 
impact  is  designated  as  an  acceptable  location  for  placement  of  dredged  material  that  is 
judged  acceptable  for  aquatic  placement  in  the  dredged  material  evaluation  or  permitting 
process.  The  permitting  process  requires  a  detailed  evaluation  of  the  specific  dredged 
material  under  consideration  in  accordance  with  regulatory  criteria  or  guidelines  and 
with  technical  evaluative  procedures  developed  jointly  by  USEPA  and  USAGE  (USEPA 
and  USAGE  1991, 1994).  Site  monitoring  serves  as  a  feedback  mechanism  to  ensure  that 
the  evaluative  procedures  are  appropriate  to  maintain  the  environmental  integrity  of  the 
site.  These  three  components  can  be  thought  of  as  a  management  continuum;  they  act 
together  to  ensure  that  dredged  material  from  multiple  projects  is  evaluated  and  man¬ 
aged  in  an  environmentally  acceptable  manner. 

USEPA  and  USAGE  have  jointly  developed  evaluative  procedures  for  determining  the 
suitability  of  dredged  material  for  placement  at  aquatic  sites.  Because  of  the  complex 
nature  of  sediment-contaminant  interactions  and  the  fact  that  contaminated  dredged 
materials  can  contain  complex  mixtures  of  a  multitude  of  toxicants,  the  GWA,  MPRSA, 
and  USEPA  and  USAGE  (1991, 1994)  guidances  focus  on  effects-based  evaluations.  The 
primary  evaluative  endpoints  are  toxicity  and  bioaccumulation  potential  of  sediment- 
associated  contaminants. 

The  current  guidance  manuals  (USEPA  and  USAGE  1991, 1994)  utilize  a  tiered  approach 
designed  to  proceed  from  simple,  cost-effective  evaluations,  which  take  advantage  of 
available  information,  to  more  complex  and  costly  assessments  that  provide  more  de¬ 
tailed  answers.  An  evaluation  proceeds  through  the  tiers  until  the  necessary  and  suffi- 
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cient  information  is  developed  to  make  a  permit  decision.  The  tiered  hierarchy  may  be 
entered  at  any  point,  providing  the  required  information  is  available. 

Tier  I  is  primarily  an  evaluation  of  existing  data.  However,  in  most  cases,  a  more  complete 
chemical  characterization  of  the  dredged  material  will  have  to  be  generated.  In  many 
cases,  a  permit  decision  can  be  made  in  Tier  I,  thus  providing  a  timely  and  cost-effective 
regulatory  decision.  However,  in  dredged  material  evaluations  involving  concerns  about 
contaminants.  Tier  I  will  typically  indicate  that  further  testing  in  subsequent  tiers  is  war¬ 
ranted. 

Tier  II,  III,  and  IV  test  procedures  and  evaluations  depend  on  use  of  a  reference  sediment. 
The  reference  sediment  concept  implements  the  regulatory  requirement  that  there  be 
“no  unacceptable  adverse  impact.”  Reference  sediment  is  defined  as  a  sediment  that  is 
substantially  free  of  contaminants,  is  as  similar  to  the  grain  size  of  the  dredged  material 
and  the  sediment  at  the  site  as  possible,  and  reflects  conditions  at  the  environs  of  the  site 
in  the  absence  of  dredged  material  placement  (USEPA  and  USAGE  1991).  Both  the  refer¬ 
ence  sediment  and  the  dredged  material  are  tested,  and  the  dredged  material  is  evaluated 
in  relation  to  the  reference  material.  In  concept,  if  the  dredged  material  does  not  produce 
greater  response  than  the  reference  material,  “no  unacceptable  adverse  impacts”  are  ex¬ 
pected. 

Tier  II  is  designed  to  take  advantage  of  predictive  assessment  models.  When  sediment 
quality  criteria  are  fully  developed  and  adopted  for  regulatory  use  they  may  be  applied  in 
Tier  II  as  part  of  the  permit  decision  process.  Currently  a  number  of  modeling  ap¬ 
proaches  (USEPA  and  USAGE  1991)  are  recommended  for  use  in  Tier  II,  including  the 
Automated  Dredging  and  Disposal  Alternatives  Management  System  (ADDAMS)  collec¬ 
tion  of  models,  which  provide  predictive  assessment  of  physical  behavior  and  potential 
water  quality  impacts,  and  the  Theoretical  Bioaccumulation  Potential  (TBP)  model, 
which  is  used  to  assess  bioaccumulation  potential  of  nonpolar  organic  contaminants  of 
concern  associated  with  the  dredged  material.  Permitting  decisions  can  be  reached  on  the 
basis  of  comparisons  to  reference  sediment  in  Tier  II,  or  further  evaluation  in  subsequent 
tiers  may  be  necessary. 

The  Tier  III  protocols  consist  of  water  column  and  benthic  toxicity  tests  and  whole  sedi¬ 
ment  bioaccumulation  tests.  Tier  III  water-column  toxicity  testing  addresses  the  acute 
toxicity  of  both  suspended  and  dissolved  fractions  of  the  dredged  material  remaining  in 
the  water  column  following  a  4-h  period  to  allow  for  initial  mixing.  Tier  III  benthic  tests 
are  conducted  with  appropriate,  sensitive,  benthic  marine  organisms  to  determine  the 
potential  for  toxicity  and  bioaccumulation  of  contaminants  from  the  dredged  material. 
Dredged  material  is  considered  unsuitable  for  unrestricted  placement  at  aquatic  sites  if 
the  dredged  material  toxicity  is  statistically  greater  than  the  reference  sediment  toxicity 
and  exceeds  the  reference  sediment  toxicity  by  at  least  20%.  Bioaccumulation  potential  of 
contaminants  associated  with  dredged  material  is  evaluated  by  comparing  contaminant 
concentrations  in  organisms  exposed  to  reference  sediment  with  concentrations  in  the 
same  species  exposed  to  dredged  material  following  28  days  of  exposure.  If  the  dredged 
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material  produces  concentrations  below  U.S.  Food  and  Drug  Administration  (FDA)  Ac¬ 
tion  Levels  for  Deleterious  Substances  and  below  reference  sediment  results,  the  dredged 
material  satisfies  the  bioaccumulation  endpoint  (USEPA  and  USAGE  1991).  If  the 
dredged  material  results  exceed  the  FDA  values,  the  dredged  material  is  considered  un¬ 
acceptable  for  unrestricted  placement  at  an  aquatic  site.  When  dredged  material  results 
are  below  FDA  values  but  above  reference  results,  interpretation  becomes  more  problem¬ 
atic.  While  the  regulations  are  concerned  about  the  potential  for  bioaccumulation,  the 
permit  may  be  denied  only  if  the  bioaccumulation  is  likely  to  cause  an  unacceptable  ad¬ 
verse  effect.  Therefore,  case-specific  criteria  that  reflect  local  conditions  are  used  to  en¬ 
hance  the  interpretative  framework  for  bioaccumulation  test  results.  Tier  III  is  intended 
to  be  sufficient  for  most  dredged  material  evaluations. 

Tier  IV  is  intended  for  use  only  in  those  instances  where  decisions  cannot  be  made  in  Tier 
III  due  to  lack  of  adequate  data.  Tier  IV  tests  consist  of  chronic  sublethal  sediment  toxic¬ 
ity  tests  and  steady-state  bioaccumulation  tests  which  account  for  long-term  effects  of  ex¬ 
posure  to  dredged  material.  Tier  IV  methodology  is  primarily  a  research  and 
development  activity  at  this  time,  and  the  guidance  requires  that  case-specific  testing  and 
interpretive  procedures  be  agreed  upon  by  USEPA  and  USAGE  before  any  Tier  IV  testing 
is  used  in  the  regulatory  program. 

Risk'based  techniques  in  current  U,S,  regulatory  evaluations 

Risk  assessment  concepts  and  terms  have  seldom  been  consciously  used  in  dredged  ma¬ 
terial  regulatory  evaluations.  Risk  assessment  approaches  are  clearly  within  the  scope  of 
the  regulations  and  may  offer  the  best  opportunity  for  refinement  of  dredged  material 
regulatory  evaluations.  While  seldom  recognized  as  such,  the  present  procedure  for 
evaluating  water  column  impacts  (USEPA  and  USAGE  1991)  is  an  application  of  the  ge¬ 
neric  ERA  paradigm  to  a  complex  mixture  of  chemicals  in  the  matrix  of  a  specific  sedi¬ 
ment.  The  effects  assessment  consists  of  determining  the  toxicity  of  a  whole  sediment 
suspension,  which  generally  approximates  the  components  of  the  dredged  material  that 
might  remain  in  suspension  around  an  aquatic  discharge.  Responses  are  observed  peri¬ 
odically  throughout  the  test,  and  LG50  values  are  calculated  at  the  end  of  each  time  pe¬ 
riod.  The  LG50  concentrations  are  plotted  through  time,  producing  a  time-mortality 
curve  (Figure  5-1).  The  equivalent  of  exposure  assessment  in  the  risk  assessment  para¬ 
digm  is  evaluated  using  a  numerical  model  to  calculate  short-term  spatial-temporal  dis¬ 
tributions  of  dredged  material  in  the  water  column.  Model  output  is  plotted  on  the  same 
graph  as  the  LG50  values,  producing  a  time-concentration  curve  (Figure  5-1).  Risk  char¬ 
acterization  consists  of  comparing  the  mortality  curve  to  the  concentration  curve  to  de¬ 
termine  whether  exposure  concentrations  are  expected  to  exceed  effects  concentrations. 
Thus,  risk  assessment  using  direct  measurement  of  biological  effects  has  been  applied  in 
dredged  material  evaluations  since  1977. 

No  equivalent,  straightforward  application  of  the  risk  assessment  paradigm  has  been 
used  for  evaluating  ecological  risk  to  benthic  biota  as  a  result  of  exposure  to  deposited 
dredged  material  (however,  see  Ghapter  16  for  a  possible  approach  to  this  issue).  This  has 
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ELAPSED  TIME,  HOURS 


Figure  5-1  Illustration  of  risk  characterization  based  on  toxicity  of  a  complex  mixture  of  chemicals.  The 
effects  measurement  endpoint  (mortality  curve)  is  compared  to  the  exposure  measurement 
endpoint  (dilution  curve)  to  determine  risk.  Reproduced  from  USEPA  and  USAGE  (1977). 


been  a  severe  gap  because  water  column  impacts  associated  with  dredged  material  dis¬ 
charges  typically  are  ephemeral  and  minor  relative  to  potential  benthic  impacts  associ¬ 
ated  with  long-term  exposure  to  deposited  sediment. 

The  water  column  procedure  notwithstanding,  most  regulatory  evaluations  of  dredged 
material  in  the  U.S.  have  not  taken  advantage  of  the  risk  assessment  concept.  They  gen¬ 
erally  consist  largely  of  effects  assessment  to  the  virtual  exclusion  of  exposure  assessment 
and  risk  characterization.  Yet  risk  assessment  is  inherently  compatible  with  the  regula¬ 
tory  requirements,  as  demonstrated  by  the  longstanding  use  of  the  risk  assessment  con¬ 
cept  in  water  column  evaluations.  An  approach  by  which  the  entire  dredged  material 
evaluation  process  could  be  conducted  on  a  risk  assessment  basis  is  presented  and  dis¬ 
cussed  in  Section  5.3. 
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5.2.3  The  Netherlands  approach 

Dredged  material  management  policies  in  the  Netherlands  grew  out  of  regulatory  and 
social  backgrounds  and  political  pressures  different  from  those  in  the  United  States  and 
therefore  approach  the  problem  differently.  Geographical  differences  have  had  a  major 
influence:  most  dredging  in  the  Netherlands  involves  sediments  associated  with  the 
Rhine  River  estuary,  while  the  U.S.  regulations  must  be  applicable  to  a  wide  variety  of 
sediments  and  circumstances  encountered  throughout  the  country.  The  general  aims  of 
the  dredged  material  management  policies  in  the  Netherlands  include  these: 

•  Protection  of  the  structure  and  function  of  the  ecosystem,  at  least  theoretically 
protecting  95%  of  the  species,  including  endangered  species  and  other  species  of 
concern 

•  Application  of  the  “stand  still”  principle,  i.e.,  no  further  deterioration  of 
receiving  environments 

•  Minimization  of  the  amount  of  dredged  material  placed  at  aquatic  sites, 
regardless  of  the  degree  of  contamination 

•  Reduction  of  contaminant  inputs  based  on  international  agreements 

5.2.3. 7  Short-term  policies 

Short-term  policies  are  based  mainly  on  chemical  analyses  of  about  40  priority  pollutants 
relevant  to  sediment  contamination  in  the  Netherlands.  Based  on  extensive  research  on 
biological  responses  to  contaminated  sediment,  sediment  chemistry-based  regulatory 
values  have  been  established.  Dredged  material  that  exceeds  upper  contaminant  levels  is 
placed  in  diked  containment  areas.  Dredged  materials  that  do  not  exceed  lower  thresh¬ 
olds  may  be  placed  at  aquatic  sites.  Dredged  materials  with  contaminant  concentrations 
between  the  upper  and  lower  levels  may  be  used  for  beneficial  purposes  or  placed  at 
aquatic  sites  as  long  as  the  “stand  still”  principle  is  adhered  to. 

Dredged  material  in  the  intermediate  category  is  evaluated  in  relation  to  more  detailed 
criteria  in  order  to  further  reduce  the  aquatic  placement  of  moderately  polluted  sedi¬ 
ments.  In  addition  to  the  chemical  analyses-based  quality  assessment,  the  use  of  toxicity 
tests  is  being  considered  to  identify  dredged  materials  that  are  toxic  even  though  they  do 
not  exceed  the  lower  threshold  chemical  concentrations.  Such  materials  would  be  placed 
in  diked  containment  areas.  A  cost-effective,  stepwise  approach  is  being  followed  in 
implementing  the  toxicity-based  program.  In  the  Netherlands’  regulatory  program,  tox¬ 
icity  tests  are  useful  only  when  the  results  might  change  a  decision  that  was  based  upon 
chemical  analyses  of  all  dredged  materials.  Monitoring  confirms  that  contaminants  of 
concern  are  properly  identified  and  that  potential  contaminants  are  considered  in  future 
dredged  material  evaluations. 

5.2.3.2  Long-term  policies 

Long-term  policies  call  for  further  reduction  of  the  placement  of  dredged  material  at 
aquatic  sites  due  to  concerns  about  possible  implications  of  contaminants  and  interna¬ 
tional  agreements  on  dredged  material  management.  In  the  meantime,  point  and 
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nonpoint  source  control  should  lead  to  less  contaminated  dredged  materials,  preferably 
to  the  point  that  placement  in  diked  containment  areas  would  not  be  required. 

5.2.3.3  Reguiatory  approach 

Chemical  analyses  of  whole  sediment  are  performed  for  about  40  contaminants  of  con¬ 
cern,  and  results  are  compared  to  the  sediment  quality  values  established  for  each  con¬ 
taminant  of  concern.  Dredged  material  is  considered  acceptable  for  placement  at  aquatic 
sites  if  none  of  the  sediment  quality  values  is  exceeded.  The  sediment  quality  values  are 
assumed  to  cover  both  direct  and  indirect  (bioaccumulation)  exposure-effect  estimates. 

Toxicity  tests  are  conducted  on  dredged  materials  with  chemical  concentrations  in  the 
intermediate  category.  If  severe  toxicity  is  revealed  in  any  test,  placement  at  aquatic  sites 
is  not  allowed.  If  necessary,  bioaccumulation  experiments  are  performed  for  specific 
compounds.  Tissue  levels  are  compared  to  values  derived  from  sediment  quality  guide¬ 
lines.  The  bioaccumulation  assays  are  also  used  to  check  deviations  from  the  EqP  theory 
used  to  derive  sediment  quality  values. 

Field  surveys  of  benthic  communities  at  aquatic  sites  are  carried  out  through  time.  The 
monitoring  does  not  influence  dredged  material  placement  decisions  but  merely  indi¬ 
cates  needs  for  further  identification  of  contaminants  of  concern. 

In  order  to  set  new  sediment  regulatory  values,  risk-based  evaluations  of  selected  con¬ 
taminants  of  concern  are  carried  out,  using  sophisticated  exposure  models  and  effects 
estimates. 

Sediment  quality  values  for  dissolved,  suspended,  and  deposited  contaminants  are  based 
upon  ecotoxicological  data  and  apply  to  freshwater  as  well  as  to  saltwater  environments. 
Sediment  quality  values  are  in  principle  equal  to  soil  values,  but  compartment-specific 
risk  evaluations  are  carried  out.  Criteria  are  normalized  for  OC  content  and  particle  size 
distribution.  Sediment  quality  values  are  derived  from  spiked  whole-sediment  toxicity 
tests  and  tests  of  dissolved  contaminants.  Data  from  dissolved  contaminant  tests  are 
translated  into  sediment  values  using  the  EqP  theory.  For  metals,  field-derived  partition¬ 
ing  constants  are  used. 

If  necessary,  sediment  samples  are  treated  to  fulfill  criteria  on  confounding  factors  like 
ammonia,  pH,  and  oxygen,  which  represent  the  tolerance  limits  of  the  specific  test  spe¬ 
cies.  Responses  in  effect  parameters  are  compared  to  those  in  control  and  in  reference 
sediments  from  near  the  site,  resembling  the  characteristics  of  the  dredged  material  of 
concern.  However,  reference  sediments  often  appear  not  to  be  free  from  contaminants 
and  effects.  Responses  are  classified  as  no  effect,  moderate  effect,  and  severe  effect  based 
upon  the  most  sensitive  parameter  per  species  (survival,  growth,  reproduction,  lumines¬ 
cence,  morphological  abnormalities). 

5.2.3.4  Linking  bioassay  responses  to  sediment  quality  objectives 

Short-term  acute  toxicity  testing  cannot  be  a  basis  for  classifying  a  dredged  material  as 
acceptable  for  placement  at  an  aquatic  site,  but  it  can  be  a  basis  for  declaring  a  sediment 
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unacceptable.  If  chronic  toxicity  testing  shows  toxicity  to  only  one  appropriate  species, 
the  dredged  material  may  presently  be  placed  at  an  aquatic  site,  but  such  placement  will 
become  unlikely  in  the  future  as  policies  become  more  restrictive.  Dredged  material  that 
shows  no  adverse  biological  effects  is  regarded  as  acceptable  for  placement  at  aquatic 
sites  and  will  remain  so  in  the  future.  For  these  sediments,  monitoring  at  the  placement 
site  is  an  important  tool  to  confirm  the  dredged  material  management  decision-making 
process. 

5.3  Application  of  risk  assessment  methodology  to  dredged 
material  evaluations 

5.3.1  Overview 

The  basic  risk  assessment  concept  includes  fundamental  components  that  can  be  sum¬ 
marized  very  briefly  as  follows.  Each  will  be  discussed  in  more  detail  in  the  context  of 
navigation  dredging  in  the  remaining  sections  of  this  description  of  the  application  of 
risk  assessment  methodology  to  dredged  material  evaluations. 

•  Problem  formulation  (Section  5.3.2)  in  which  the  problem  is  defined,  potential 
stressors  are  identified,  and  the  application  of  the  risk  assessment  framework  to 
the  specific  situation  is  structured 

•  Effects  assessment  (Section  5.3.3)  in  which  the  nature  of  the  potential  effects  of 
the  stressors  and  dose-response  relationships  are  determined 

•  Exposure  assessment  (Section  5.3.4)  in  which  the  nature  of  the  exposure  to  the 
stressors  is  determined 

•  Risk  characterization  (Section  5.3.5)  in  which  expected  exposure  conditions  are 
compared  with  conditions  necessary  to  cause  effects,  to  determine  whether 
effects  would  be  likely  to  occur 

In  the  past,  most  evaluations  of  dredged  material  for  management  purposes  focused  on 
the  dredged  material  itself.  Evaluation  of  the  placement  site  was  often  a  secondary  issue, 
usually  treated  more  or  less  separately  from  evaluations  of  the  individual  materials  to  be 
placed  at  the  site.  In  reality,  characteristics  of  both  the  site  and  the  materials  placed  there 
are  inextricably  involved  in  determining  the  environmental  impacts  that  could  occur. 

Both  exposure  and  effect  must  exist  before  risk  can  occur.  In  relation  to  navigation  dredg¬ 
ing,  exposure  occurs  at  the  placement  site  (except  in  evaluations  of  the  “no  action  alter¬ 
native”).  Therefore,  the  potential  risks  of  a  particular  dredged  material  can  be 
determined  only  in  the  context  of  the  proposed  placement  site  (or  its  present  location  in 
evaluations  of  the  “no  action  alternative”)  and  the  exposure  conditions  existing  at  that 
site.  Effects  tests  of  the  material  are  a  necessary  but  insufficient  component;  risk  can  be 
determined  only  when  the  effects  information  is  considered  in  conjunction  with  the  ex¬ 
posure  conditions  at  the  placement  site.  Therefore,  exposure  assessment  at  the  place¬ 
ment  site  is  essential  if  risk  assessment  is  to  be  applied  to  navigation  dredging.  This 
principle  has  not  been  commonly  recognized  in  dredged  material  evaluation  and  under- 
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lies  the  remainder  of  this  discussion  of  the  application  of  risk  assessment  to  navigation 
dredging. 

Although  risk  management  discussions  are  beyond  the  scope  of  this  document,  a  corol¬ 
lary  of  the  above  principle  must  be  mentioned  here  for  completeness.  Exposure  at  the 
placement  site  is  key,  not  only  for  risk  assessment  but  also  for  risk  management.  The 
potential  toxicity  and  other  effects  of  a  particular  dredged  material  are  characteristic  of 
the  material  and  cannot  be  easily  altered.  Exposure,  however,  can  be  controlled  by  a  va¬ 
riety  of  proven  techniques,  including  use  of  a  different  placement  site,  various  controls  on 
the  placement  process,  and  engineering  techniques  such  as  capping  to  reduce  exposure. 

The  value  of  risk  assessment  cannot  be  realized  if  it  is  conducted  in  a  vacuum.  It  is  im¬ 
perative  that  the  risk  assessment  be  fully  integrated  with  risk  management  needs  from 
the  very  earliest  stages.  Only  in  this  way  can  the  entire  process  lead  to  optimal  environ¬ 
mental  decisions.  The  remaining  discussion  relates  to  application  of  risk  assessment  to 
evaluation  of  dredged  material  proposed  for  placement  at  an  aquatic  (subaqueous)  site, 
which  may  also  receive  other  dredged  materials  from  other  locations  before,  while,  or 
after  the  material  in  question  is  placed  there  (a  multi-user  site).  Dredged  material  is  de¬ 
scribed  in  Section  5.2  as  a  “complex  mixture”  of  sediments  and  chemicals.  Dredged  ma¬ 
terial  placed  at  a  multi-user  site  in  fact  becomes  part  of  a  “complex  mixture  of  complex 
mixtures.”  Environmental  exposure  at  a  multi-user  site  is  the  most  important  issue  for 
application  of  risk  assessment  to  dredged  material  evaluations  and  is  the  focus  of  the 
following  discussion.  Therefore,  the  discussion  does  not  address  evaluation  of  the  “no 
action  alternative”  or  placement  at  upland  or  other  sites.  Neither  does  this  discussion 
consider  evaluation  of  the  dredging  operation  itself,  as  distinct  from  the  placement  of  the 
dredged  material.  However,  the  basic  principles  are  applicable  to  all  dredged  material 
evaluations  and  can  readily  be  modified  as  appropriate  for  use  in  a  wide  variety  of  con¬ 
texts.  Similarly,  this  SETAC  Special  Publication  focuses  on  ERA,  but  many  of  the  basic 
principles  are  equally  appropriate  to  HHRA. 

5.3.2  Problem  formulation 

Dredged  material  disposed  into  an  aquatic  system  can  potentially  affect  the  environment 
through  a  number  of  different  pathways.  This  section  examines  the  stressors  associated 
with  ecological  risk,  identifies  the  types  of  potential  ecological  receptors  that  risk  assess¬ 
ment  should  focus  on,  and  indicates  the  types  of  assessment  endpoints  that  should  be 
used  to  analyze  ecological  risk.  Two  things  should  be  kept  in  mind.  First,  pathways  to 
human  receptors  are  not  addressed  in  this  discussion;  the  issue  is  important  but  beyond 
the  scope  of  this  effort.  Second,  the  examples  provided  are  nothing  more  or  less  than 
examples;  any  application  of  these  principles  must  provide  comprehensive  analysis  of  the 
appropriate  parameters  for  the  system  in  question. 

5.3.2. 7  Stressors 

Placement  of  dredged  material  at  an  aquatic  site  can  involve  physical  and/or  chemical 
stressors  that  can  act  in  the  short  or  long  term,  either  near  or  far  from  the  site.  Physical 
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stressors  are  common  to  all  aquatic  placement  of  dredged  material  and  can  include  such 
things  as  the  initial  discharge  that  can  bury  the  benthos  and  may  physically  alter  the  sub¬ 
strate  at  the  site,  thereby  affecting  recolonization.  Dredged  materials  may  or  may  not 
contain  chemical  stressors,  which  can  be  present  singly  or  as  a  wide  variety  of  chemical 
types  and  forms,  depending  on  the  source  of  the  dredged  material  and  the  sources  of 
contamination  to  it.  These  chemical  stressors  include  not  only  the  chemicals  that  are 
measured  but  also  those  that  are  unmeasured.  Major  kinds  of  stressors  that  may  or  may 
not  be  associated  with  any  particular  dredged  material  are  summarized  with  their  most 
likely  field  of  influence  in  Table  5-1  and  discussed  in  the  context  of  the  conceptual  site 
model  in  Section  5.3.2.4.  Physical  stressors  can  be  important  in  some  circumstances,  but 
for  the  sake  of  brevity,  the  remainder  of  this  discussion  is  focused  on  chemical  stressors. 

5.3.2.2  Bcologieal  receptors  and  pathways 

Multi-user  aquatic  placement  sites,  which  are  the  focus  of  this  discussion,  can  be  found 
on  all  sorts  of  bottom  types  in  fresh  or  salt  water  at  depths  of  a  few  meters  to  a  few  thou¬ 
sand  meters,  with  wind  and/or  current  energies  causing  negligible  to  rapid  sediment 
erosion  and  transport.  Across  such  a  variety  of  environmental  conditions,  few  specifics 
can  be  offered  about  the  exposure  pathways  or  potential  receptors  that  might  be  of  con¬ 
cern  for  any  particular  dredging  project.  However,  some  general  concepts  are  provided  to 
guide  the  identification  of  pathways  and  receptors  for  consideration  on  individual 
projects. 

Ecological  risk  assessment  of  aquatic  placement  of  dredged  material  will  often  focus  on 
four  primary  categories  of  receptors,  whose  relevance  and  importance  will  differ  depend¬ 
ing  on  the  circumstances  of  each  operation.  These  primary  receptor  groups  are  1)  the 
infaunal  benthic  community,  2)  the  epibenthic  community,  3)  the  demersal  fish  commu¬ 
nity,  and  4)  the  pelagic  fish  and  invertebrate  communities.  A  fifth  group,  birds  and  wild¬ 
life  that  rely  on  fish  and  invertebrates  as  prey,  could  also  be  potentially  affected  in 
situations  where  a  complete  exposure  route  from  the  dredged  material  exists  and,  in  such 
cases,  should  be  considered  where  food  chain  transfer  of  contaminants  to  these  receptors 
is  a  potential  concern. 

The  selection  of  receptors  follows  directly  from  an  understanding  of  the  exposure  path¬ 
ways  for  contaminants  from  dredged  material  to  the  environment  (Figure  5-2).  The  pri¬ 
mary  pathways  of  concern  at  the  placement  site  (/>.,  below  the  dashed  line  in  Figure  5-2) 
are  these: 

1)  Entrainment  and  dispersion  as  the  material  settles  to  the  bottom.  Dissolved  and 
particulate  stressors  are  directly  exposed  to  the  water  column  communities, 
leading  to  identification  of  the  demersal  and  pelagic  fish  and  invertebrate 
communities  as  potential  receptors. 

2)  Direct  exposure  from  the  mound  of  dredged  material  deposited  on  the  bottom. 
Sediments  are  directly  ingested  by  species  within  the  epibenthic  and  benthic 
infaunal  communities,  leading  to  identification  of  these  communities  as 
potential  receptors.  Exposure  can  also  occur  due  to  dermal  contact  with 
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Table  5-1  Summary  of  major  types  of  potential  stressors  that  may  exist  at  an  aquatic  dredged  material 
placement  site 


Area  of  potential  influence 

Time  of  potential  influence 

Type  of  stressor 

Near  field 

Far  field 

Short  term 

Long  term 

Physical 

Burial 

X 

X 

Alteration  of 
substrate  type 

primary, 

during  placement 

secondary, 
due  to  subsequent 
sediment  transport 

primary, 

during  placement 

secondary, 
due  to  subsequent 
sediment  transport 

Chemical 

Dissolved 

contaminants 

X 

primary, 

during  placement 

secondary, 
due  to  desorption 

Sediment-sorbed 

contaminants 

X 

X 

Contaminants 
entering  food  web 

X 

X 

X 

sediments  or  contact  with  pore  water,  again  potentially  affecting  the  epifaunal 
and  infaunal  benthic  communities. 

3)  Movement  from  the  deposited  mound.  Sediments  may  move  from  their  original 
point  of  deposition  through  resuspension,  transport,  and  redeposition.  These 
sediment  movements  could  increase  the  intensity  of  exposure  and/or  the  area 
over  which  exposure  occurs  for  the  benthic  and  epibenthic  communities. 
Subsequent  disposal  at  multi-user  sites  can  substantially  alter  the  intensity  and/ 
or  spatial  nature  of  exposure  to  one  dredged  material  by  displacing  and/or 
covering  it  entirely  or  in  part  with  other  dredged  material. 

4)  Trophic  transfer.  Uptake  into  the  food  web  and  transfer  to  higher  trophic  levels 
is  a  possible  exposure  pathway  for  all  of  the  above  receptor  communities.  This 
pathway  might  warrant  particular  consideration  with  DDT,  DDE,  PCB,  tox- 
aphene,  arsenic,  methyl  mercury,  and  total  mercury,  which  have  been  shown  to 
have  the  potential  to  biomagnify  through  aquatic  food  webs  (Suedel  et  al  1994). 

5.3.2.3  Assessment  endpoints  and  measurement  endpoints 

Assessment  endpoints  must  be  developed  for  each  individual  site  in  keeping  with  the 
characteristics  of  the  site  and  the  potential  receptors  that  might  be  present.  This  discus¬ 
sion  is  intended  to  illustrate  the  nature  of  analysis  entailed  in  reaching  a  site-specific  list 
of  assessment  endpoints. 

Under  the  structure  suggested  by  Suter  (1993),  establishing  assessment  endpoints  is  the 
first  of  several  steps  that  translate  general  policy  objectives,  such  as  those  found  in  the 
international  treaties  or  national  laws,  regulations,  and  policies,  into  measurement  end- 
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Bioaccumulation  and  Trophic  Transfer 


Figure  5-2  Possible  exposure  pathways  during  dredging  and  dredged  material  placement  at  an 
aquatic  site 


points  that  can  be  evaluated  in  risk  assessment  framework.  Suter  s  (1993)  intermediate 
step  between  assessment  endpoint  and  measurement  endpoint  was  to  select  indicators  of 
effects  {e.g.,  changes  in  abundance).  A  slightly  more  elaborate  series  of  steps  is  used  here 
to  select  assessment  and  measurement  endpoints.  The  first  step  is  to  identify  the  recep¬ 
tors  {i.e.,  ecological  elements)  of  concern.  For  example,  a  species  such  as  flounder  may  be 
the  basis  of  an  important  fishery  or  food  for  other  valued  species.  The  second  step  is  to 
identify  the  receptor  attribute  that  might  be  affected.  For  example,  the  abundance  of 
flounder  might  be  affected  by  physical  and/or  chemical  stressors  associated  with  place¬ 
ment  of  the  dredged  material  at  the  site.  The  third  step  is  to  identify  a  quantitative  assess¬ 
ment  endpoint  that  reflects  the  importance  of  the  receptor  and  specifies  both  the 
attribute  of  interest  and  the  level  of  effect.  In  addition,  a  time  component  is  often  impor¬ 
tant  because  dredged  material  effects  may  be  of  finite  duration.  In  the  flounder  example. 
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an  assessment  endpoint  might  be  that  the  abundance  of  flounder  in  the  immediate  vicin¬ 
ity  of  the  site  be  within  20%  of  the  abundance  in  a  nearby  reference  area  within  one  year 
after  the  operation  is  completed. 

Assessment  endpoints  are  the  basis  for  establishing  measurement  endpoints  that  allow 
assessment  of  the  risk  to  the  receptor  from  placement  of  the  dredged  material  at  the  site. 
Measurement  endpoints  must  consider  the  potential  exposure  pathways  from  the 
dredged  material  to  the  receptors  of  concern.  In  the  case  of  the  flounder,  measurement 
endpoints  might  include  toxicity  of  the  dredged  material  to  juvenile  flounder  or  to  am- 
phipods^  polychaetes,  or  other  surrogates  for  important  flounder  food  in  the  site  vicinity. 
Such  endpoints  can  be  measured  in  laboratory  toxicity  tests,  which  if  appropriate  could 
be  designed  to  measure  other  endpoints  related  to  sublethal  effects  of  the  dredged  mate¬ 
rial  on  growth  and  reproduction,  or  even  the  potential  for  bioaccumulation  and  trophic 
transfer  through  prey  to  flounder.  The  latter  could  also  be  “measured”  using  verified  and 
calibrated  mathematical  models. 

Measurement  endpoints  provide  information  needed  to  evaluate  assessment  endpoints 
and  might  be  considered  surrogate  measures  or  indicators  of  the  latter.  In  this  regard, 
measurement  endpoints  relate  to  ecological  effects  (USEPA 1992).  The  rationale  for  iden¬ 
tification  and  selection  of  measurement  endpoints  includes  their  relevance  to  the  assess¬ 
ment  endpoints,  the  practicality  of  approaches  for  their  quantification,  their  sensitivity 
and  response  characteristics,  and  their  consistency  with  assessment  endpoint  exposure 
scenarios  (USEPA  1992).  Research  in  the  field  of  sediment  assessment  has  identified  and 
evaluated  several  tools  that  are  potentially  useful  as  measurement  endpoints  in  risk  as¬ 
sessment  of  dredged  sediment.  Although  not  intended  to  be  an  exhaustive  list,  several 
classes  of  these  are  identified  in  Table  5-2. 

The  utility  of  any  of  these  within  a  particular  assessment  should  be  evaluated  against  the 
rationale  listed  above.  For  example,  measurement  endpoints  appropriate  for  the  assess¬ 
ment  endpoints  addressing  declines  in  flounder  abundance  include  sediment  and  water 
quality  criteria  and  guidelines;  toxicity  of  deposited  and  suspended  dredged  material  to 
life  stages  of  flounder,  surrogate  species,  and  infaunal  prey  species;  and  perhaps  sum¬ 
mary  statistics  derived  from  an  appropriately  developed  model  of  flounder  population 
dynamics.  Some  of  these  (e.g.,  toxicity  to  life  stages  of  flounder)  relate  to  the  direct  effects 
of  chemical  stressors  associated  with  the  dredged  material,  whereas  others  (e.g.,  toxicity 
to  infaunal  species)  are  appropriate  in  evaluations  of  indirect  effects.  Each  varies  with 
respect  to  its  degree  of  association  to  the  assessment  endpoint  and  therefore  with  respect 
to  the  level  of  uncertainty  associated  with  its  extrapolation. 

The  process  of  identifying  the  important  ecosystem  characteristics  concerning  a  particu¬ 
lar  dredged  material  placement  operation,  determining  the  appropriate  corresponding 
receptors  of  concern,  establishing  assessment  endpoints,  and  determining  the  appropri¬ 
ate  measurement  endpoints  should  be  applied  for  each  ecosystem  characteristic  of  con¬ 
cern  for  a  particular  project  until  assessment  and  measurement  endpoints  have  been 
established  for  all  ecosystem  characteristics  of  concern. 


SET  AC  Press 


56 


ECOLOGICAL  RISK  ASSESSMENT  OF  CONTAMINATED  SEDIMENTS 


Table  5-2  Example  assessment  endpoint  and  relevant  measurement  endpoints  and  exposure  measures 
that  might  be  useful  in  ecological  risk  assessment  of  navigational  dredging 


Assessment  endpoint 

Measurement  endpoints 

Exposure  measures 

Ecologically  important  decline 
in  flounder  abundance 

Sediment  quality  criteria/guidelines 

Dredged  sediment  volume 

Water  quality  criteria/guidelines 

Dredged  and  site  sediment 
geotechnical  attributes  {e.g. , 
granulometry,  porosity,  specific 
gravity) 

Endpoints  from  acute  and  chronic 
toxicity  tests  of  whole  sediment, 
pore  water,  and  elutriate  water 

Dredged  material  and  site 
sediment  bulk,  bioavailability, 
and  elutriate  chemistry 

Abundance  of  prey  species 

Dredged  and  site  sediment 
geochemistry  (AVS,  TOC,  pH, 
ammonia) 

Inputs  to  ecological  models 

Modeled  and  measured 
contaminant  concentrations  in 
prey  species 

Demographic  and  population 
statistics  of  unimpacted  (reference) 
populations 

S.3,2,4  Conceptual  model 

The  final  step  of  problem  formulation  involves  development  of  the  conceptual  model. 
The  purpose  of  this  model  is  to  summarize  available  information  concerning  stressors, 
ecological  receptors,  and  assessment  and  measurement  endpoints  important  to  the  as¬ 
sessment.  Developed  as  a  set  of  working  hypotheses,  this  model  describes  the  environ¬ 
mental  processes  and  exposure  pathways  relating  stressors  to  receptors,  identifies  points 
along  those  pathways  where  important  exposure  information  should  be  obtained,  and 
indicates  possible  mechanisms  of  impact  to  ecological  systems.  The  conceptual  model 
also  identifies  the  spatial  and  temporal  scales  and  boundaries  of  the  assessment,  and  it 
identifies  and  relates  measurement  endpoints  to  the  assessment  endpoints.  This  is  the 
appropriate  time  to  select  the  risk  characterization  approach  to  be  used,  as  discussed  in 
Section  5.3.5. 

A  generalized  conceptual  model  that  addresses  several  spatial  and  temporal  scales  can  be 
developed.  At  the  broadest  scale  is  a  description  of  potential  exposure  to  ecological  recep¬ 
tors  during  all  phases  of  dredging  and  dredged  material  placement  (Figure  5-2).  As  de¬ 
scribed  previously,  potential  release  of  dredged  material  and  possible  associated 
contaminants  can  occur  at  several  points  along  a  sequence  from  the  dredging  operation 
to  the  dredged  material  mound  deposited  at  the  site.  Since  these  “releases”  occur  in  dif¬ 
ferent  parts  of  the  water  body  and  on  different  temporal  scales,  the  ecological  risks  asso- 
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dated  with  each  will  vary  with  respect  to  stressors,  ecological  receptors,  and  routes  of 
exposure.  In  keeping  with  the  previously  established  scope  of  this  paper,  the  remaining 
discussion  of  conceptual  models  will  focus  on  the  ecological  risks  of  a  specific  dredged 
material  placed  at  a  multi-user  aquatic  site  (illustrated  below  the  dashed  line  in  Figure  5- 
2).  The  spatial  boundaries  of  the  assessment  therefore  include  the  site  itself,  as  well  as 
areas  surrounding  the  site,  which  might  be  influenced  by  resuspension  and  subsequent 
transport  from  the  mound. 

Scenarios  of  exposure  at  the  site  include  the  possible  entrainment  of  dredged  material 
into  the  water  column  during  convective  descent  and  settlement  following  release  from 
the  transport  vessels,  exposure  to  deposited  material  on  and  around  the  mound,  and 
resuspension  and  subsequent  transport  of  dredged  material  particles  from  the  mound. 
At  a  smaller  scale,  the  dynamics  of  particles  and  chemical  stressors  become  important. 
Contaminants  may  partition  dynamically  between  sorbed  and  dissolved  states  and  may 
be  available  for  uptake  by  organisms  in  either  state.  Similar  behavior  occurs  in  the  depos¬ 
ited  sediment,  where  chemicals  may  partition  between  particle  surfaces  and  pore  water. 
These  processes  affect  the  transport  and  fate  of  chemical  contaminants  and  thus  the  ex¬ 
posure  of  receptors. 

The  aquatic  environment  into  which  dredged  material  is  placed  can  be  partitioned  into 
several  compartments,  based  on  biological,  chemical,  and  physical  properties.  Included 
are  the  benthic  and  pelagic  or  water  column  compartments  as  well  as  the  epibenthic 
compartment  (Figure  5-3).  The  benthic  compartment  can  be  considered  to  consist  of  the 
newly  created  mound  and  the  original  or  relic  underlying  sediments  which  existed  at  the 
site  prior  to  dredged  material  placement.  Exposure  dynamics  and  potential  ecological 
effects  are  different  in  all  compartments  and  are  functions  of  the  physical,  geochemical, 
and  biological  processes  occurring  in  each.  The  conceptual  model  should  recognize  the 
existence  of  these  various  compartments  and  processes  and  should  evaluate  potential 
ecological  risks  within  each  as  appropriate. 

The  conceptual  model  reflects  explicit  exposure  pathways,  from  the  source  and  nature  of 
the  environmental  stressor  (such  as  chemical  contaminants  in  the  dredged  material)  to 
the  ecological  receptors  about  which  assessment  endpoints  are  developed.  Borrowed 
from  classical  exposure  pathway  analysis,  this  level  of  representation  of  the  conceptual 
model  allows  not  only  for  explicit  visualization  of  routes  of  exposure  but  also  helps  to 
identify  key  exposure  points  (loci  along  the  exposure  pathway  which,  through  direct  or 
indirect  measurement,  permit  quantification  of  exposure  experienced  by  the  receptors  of 
concern)  for  exposure  characterization  activities.  Exposure  pathways  for  the  flounder 
population  abundance  assessment  endpoint  are  shown  in  Figure  5-4.  Clearly,  there  are 
several  possible  pathways  by  which  flounder  could  be  exposed  to  stressors  associated 
with  dredged  material.  Included  are  both  direct  effects  of  contaminant  exposure  through 
direct  contact  and  trophic  transfer  and  indirect  effects  associated  with  possible  reduc¬ 
tions  in  prey  availability  resulting  from  direct  stressor  effects  on  those  prey.  Burial  as  a 
stress  is  represented  in  Figure  5-4,  but  burial  of  flounder,  while  possible,  is  deemed  to  be 
relatively  rare  and  therefore  is  not  considered  further  in  this  discussion.  However,  both 
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Figure  5-3  Compartments  of  the  aquatic  environment  to  be  considered  in  dredged  material  risk 
assessment 


Processes  and  Ecological 

Intermediate  Sources  Receptor 


Figure  5-4  Major  exposure  pathways  that  might  be  relevant  to  a  flounder  population  abundance 
assessment  endpoint 
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the  direct  effects  of  flounder  burial  and  the  indirect  effects  of  reductions  in  prey  availabil¬ 
ity  due  to  burial  should  be  considered  in  actual  dredged  material  evaluations. 

Key  exposure  points,  and  therefore  media  for  exposure  measurement  for  the  flounder 
assessment  endpoint,  include  the  deposited  dredged  material,  water  influenced  by  con¬ 
taminants  dissolved  from  the  dredged  material,  and  flounder  prey.  In  the  prospective 
view  characteristic  of  dredged  material  risk  assessments,  exposure  measures  may  be  de¬ 
rived  from  physical  or  mathematical  simulations  of  exposure  conditions  (involving  such 
processes  as  sediment  deposition  and  entrainment,  hydrodynamic  transport,  and  bioac¬ 
cumulation  and  trophic  transfer)  or  may  be  based  upon  data  collected  in  field  investiga¬ 
tions.  Such  exposure  measurements  are  subject  to  validation  and  are  associated  with 
some  degree  of  uncertainty.  This  uncertainty  should  be  quantified  to  the  extent  possible 
to  aid  in  interpretation  of  risk.  Several  classes  of  exposure  measures  proven  to  be  useful 
tools  are  presented  as  examples  in  Table  5-2.  Selecting  appropriate  exposure  measures 
involves  evaluating  their  ability  to  reflect  characteristics  of  exposure  along  a  pathway 
from  stressor  source  to  ecological  receptor.  As  importantly,  they  should  provide  informa¬ 
tion  useful  for  predicting  and  interpreting  changes  in  measurement  endpoints.  Examples 
of  valuable  exposure  measures  for  the  flounder  assessment  endpoint  include  the  volume 
and  spatial  distribution  of  the  dredged  material  deposits  and  the  physical  and  chemical 
characteristics  of  the  dredged  material. 

The  conceptual  model  also  considers  the  potential  ecological  effects  expected  to  result 
from  stressor  presence  and  subsequent  exposure.  As  identified  earlier,  such  effects  de¬ 
pend  on  the  nature  of  the  stressors  as  well  as  on  the  kinds  and  numbers  of  ecological  re¬ 
ceptors  present  within  the  influence  of  the  dredged  material.  Potential  ecological  effects 
are  summarized  in  the  conceptual  model  as  a  function  of  important  assessment  end¬ 
points  and  pathways  of  exposure  from  source  to  receptors.  Ecological  effects  hypoth¬ 
esized  for  the  flounder  assessment  endpoint  include  direct  toxicological  impact  to 
individual  flounder  (perhaps  ranging  from  subtle  physiological  and  cytogenetic  effects  to 
gross  reproductive  and  survival  impacts)  as  a  result  of  contact  with  contaminants  in  the 
sediment,  direct  impact  due  to  burial  of  flounder  (not  followed  in  this  discussion),  as  well 
as  impacts  on  flounder  population  dynamics  resulting  from  changes  in  demographics. 
Direct  toxicological  effects  also  might  occur  as  a  result  of  the  transfer  of  chemical  con¬ 
taminants  to  flounder  through  trophic  pathways.  Indirect  effects  resulting  from  reduc¬ 
tion  in  the  availability  of  prey  through  toxicological  impact  and  burial  might  also  be 
considered.  An  understanding  of  the  potential  effects  of  stressor  exposure  on  ecological 
receptors  supports  identification  of  measurement  endpoints  suitable  to  evaluate  assess¬ 
ment  endpoints.  The  conceptual  model  should  explicitly  define  the  activities  to  be  con¬ 
ducted  in  the  effects  assessment,  exposure  assessment,  and  risk  characterization 
components  of  the  risk  assessment.  These  activities  derive  directly  from  problem  formu¬ 
lation  and  the  fully  developed  conceptual  model.  Clearly,  the  product  of  problem  formu¬ 
lation  depends  upon  the  aspect  of  navigation  dredging  being  evaluated,  that  is,  the 
component  along  the  sequence  from  dredging  to  placement  illustrated  in  Figure  5-2.  The 
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discussion  provided  above  should  aid  in  appropriate  problem  formulation  for  any  aspect 
of  navigation  dredging. 

5.3.3  Effects  assessment 

Effects  assessment  is  the  process  of  establishing  the  strength  or  magnitude  of  the  potential 
impact  dredged  material  could  have  if  organisms  were  to  be  exposed  to  it.  Impacts  may 
be  related  to  physical  stressors  {e.g.,  burial  or  change  in  physical  characteristics  like  grain 
size)  or  to  chemical  stressors  {e.g.,  dissolved  or  sorbed  contaminants).  Conduct  of  an  ef¬ 
fects  assessment  for  dredged  material  is  driven  by  1)  the  biological  receptors  of  concern 
and  2)  the  anticipated  exposure  scenarios.  That  is,  effects  are  assessed  in  the  context  of 
affected  receptors  and  expected  environmental  exposures  (see  exposure  assessment  as 
discussed  in  Section  5.3.4).  One  of  the  most  frequently  evaluated  scenarios  is  the  effect  of 
sediment-associated  chemicals  in  deposited  dredged  material  on  the  benthic  community. 
To  assess  these  effects,  toxicity  tests  of  deposited  dredged  material  are  conducted  with 
appropriately  sensitive  infaunal  or  epifaunal  species.  The  potential  for  contaminants  to 
bioaccumulate  from  deposited  sediments  is  also  usually  determined  as  part  of  the  effects 
assessment.  Although  bioaccumulation  per  se  is  a  phenomenon  and  not  an  effect,  the 
data  are  reviewed  for  potential  residue-effects  relationships.  Effects  in  the  water  column 
are  evaluated  by  conducting  acute  (48-h  to  96-h)  elutriate  toxicity  tests  with  appropriately 
sensitive  water  column  species.  Elutriate  and  deposited  sediment  toxicity  tests  evaluate 
response  to  all  contaminants  in  the  dredged  material,  both  known  and  unknown.  Most 
sediment  toxicity  tests  available  today  measure  survival  or  growth  after  short-term  expo¬ 
sures  (4  to  10  days).  A  new  generation  of  toxicity  tests  involving  longer  exposures  and 
sublethal  endpoints  {e.g.,  growth  and  reproduction)  are  being  developed  (Dillon  1993; 
also  see  Chapter  18).  As  these  chronic  sublethal  sediment  bioassays  become  available, 
they  will  likely  be  used  with  increasing  frequency  as  effects  assessment  tools. 

If  effects  on  receptors  at  higher  trophic  levels  are  of  concern,  a  chemical-by-chemical 
approach  may  be  taken.  For  example,  the  approach  can  begin  with  case-specific  bioaccu¬ 
mulation  data  and  can  model  trophic  transfer  of  individual  chemicals  from  benthic  in¬ 
fauna  to  a  predator  {e.g.,  flounder,  humans).  Effects  can  then  be  evaluated  by  examining 
residue-based  or  ingestion-based  benchmark  toxicity  taken  from  the  literature  or  gener¬ 
ated  empirically  on  a  site-specific  basis. 

To  improve  the  utility  of  effects  assessments  in  dredged  material  risk  assessments,  four 
major  issues  must  be  addressed:  1)  interpretive  guidance,  2)  extrapolations,  3)  cumula¬ 
tive  effects,  and  4)  recovery  potential.  The  first  two  concern  the  laboratory  assessments 
of  the  effects  of  dredged  material  {i.e.,  toxicity  tests)  that  are  done  today.  The  latter  two 
issues  refer  to  the  field  evaluation  of  dredged  material  impacts  at  the  placement  site.  Al¬ 
though  lab  evaluations  have  historically  been  emphasized  over  field  studies,  it  is  the  field 
impacts  that  are  of  ultimate  ecological  and  regulatory  concern  (“no  unacceptable  adverse 
impacts”). 
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5.3.3. 7  Interpretive  guidance 

Comparison  to  a  reference  sediment  is  currently  a  major  component  of  interpreting 
dredged  material  toxicity  and  bioaccumulation  test  results  in  the  U.S.  regulatory  pro¬ 
gram  for  dredged  material.  Comparison  to  reference  provides  a  relative  basis  for  inter¬ 
preting  results.  The  technical  basis  for  an  absolute  interpretation,  (z.c.,  what  does  15% 
mortality  mean  ecologically  versus  25%  mortality?)  is  more  elusive.  Absolute  interpreta¬ 
tions  become  even  more  problematic  when  sublethal  endpoints  are  examined.  Demo¬ 
graphic  models  may  form  the  basis  for  interpreting  chronic  sublethal  bioassays  with 
dredged  material.  Demographic  models  integrate  multiple  life  history  information  (e.g., 
survival,  growth,  reproduction)  and  express  toxicity  at  a  population  level  (Hummon 
1974;  Gentile  eta/.  1982;  Wong  and  Wong  1990).  This  source  of  interpretive  guidance  is 
still  firmly  embedded  in  the  research  and  development  community. 

In  some  instances,  the  basis  for  interpretive  guidance  is  a  chemical-specific  benchmark 
toxicity  value.  For  example,  trophic  transfer  modeling  may  predict  that  a  contaminant  of 
concern  associated  with  the  dredged  material  in  question  could  bioaccumulate  to  X  con¬ 
centration  in  flounder.  One  would  compare  this  predicted  tissue  concentration  with  a 
benchmark  toxicity  value.  These  values  are  occasionally  reported  in  the  scientific  litera¬ 
ture,  but  in  practice,  there  often  is  no  benchmark  toxicity  value  for  the  chemical-receptor 
combination  of  interest,  and  one  must  be  developed  by  extrapolation  among  species 
and/or  contaminants.  Interpretive  guidance  based  on  chemical-specific  toxicity  bench¬ 
marks  must  also  always  consider  the  potential  impacts  of  contaminant  interactions,  e.g., 
synergism  and  antagonism  (see  Chapters  16  and  18). 

5.3.3.2  Extrapolations 

Risk  assessment  is  basically  an  exercise  in  extrapolations.  Important  extrapolations,  re¬ 
quired  to  relate  effects  assessment  data  to  the  original  assessment  endpoints  during  risk 
characterization  (see  Section  5.3.5),  include  the  following: 

•  Species  to  species:  It  is  seldom  practical  to  test  all  species  of  concern. 

•  Response  to  response:  One  may  measure  acute  toxicity  but  also  may  be  inter¬ 
ested  in  effects  on  survival  or  reproduction. 

•  Individual  to  population:  Effects  on  individual  organisms  are  measured  in  most 
tests,  but  it  is  usually  the  population  of  organisms  that  is  of  concern. 

•  Laboratory  to  field:  Effects  under  controlled  laboratory  conditions  may  or  may 
not  mimic  effects  under  actual  exposure  conditions  at  the  placement  site. 

•  Present  to  future:  Effects  observed  today  may  or  may  not  persist  into  the  future. 

5.3.3.3  Cumulative  effects 

The  term  cumulative  ^ects  has  been  used  to  refer  to  effects  over  time  or  to  the  combined 
effects  of  multiple  stressors  (e.g,  metals  plus  pesticides).  It  has  also  been  used  to  refer  to 
the  combined  effects  of  multiple  events  (e.g,  placement  of  dredged  material  from  several 
different  projects  at  a  site  over  time).  The  term  is  used  in  the  latter  sense  in  the  U.S. 
dredging  regulations  (e.g,  40  CFR  230.11(g))  and  in  this  discussion.  The  concern  about 
cumulative  effects  focuses  not  on  project-specific  toxicity  tests  but  on  the  health  and  vi- 
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ability  of  the  benthic  communities  near  the  placement  site.  International  dredged- 
material  regulations  address  the  concept  of  cumulative  effects  of  all  activities  taking  place 
at  the  site,  and  cumulative  effects  are  important  in  complete  risk  assessment  and  risk 
management  contexts.  However,  practical  and  appropriate  ways  to  quantitatively  predict 
cumulative  effects  have  received  little  research  attention,  some  of  which  is  addressed  in 
Chapter  16.  A  way  of  including  cumulative  effects  in  dredged  material  risk  assessments, 
based  on  the  effects  assessments  of  each  material  previously  placed  at  the  site,  is  de¬ 
scribed  in  the  context  of  cumulative  exposure  at  the  site  in  Section  5.3.4. 

5.3.4  Exposure  assessment 

In  the  context  of  risk  assessment  applied  to  dredged  material  placed  at  a  multi-user  site, 
exposure  assessment  must  address  a  variety  of  conditions,  including,  for  example,  the 
following: 

•  Suspended  and  deposited  sediments 

•  Multiple  projects  of  different  size,  sediment  quality,  and  dredging  method  (and 
their  different  physical  behaviors)  placed  at  a  single  site  over  time 

•  Sedentary  and  mobile  receptors  of  concern 

•  Direct  exposure  of  biota  and  food  web  vectors  for  contaminant  transfer 

In  addition,  the  regulatory  context  in  which  many  risk  assessments  of  navigation  dredg¬ 
ing  are  conducted  imposes  other  constraints.  For  example,  in  the  U.S.,  most  navigation 
dredging  risk  assessments  will  have  to  contend  with 

•  effects  evaluated  on  the  basis  of  whole  sediment  biological  tests  as  well  as  on  the 
basis  of  chemical  analyses  and 

•  compatibility  with  a  regulatory  context  in  which  permit  decisions  must  be  made 
expeditiously. 

5,3.4.1  Exposures  to  be  considered 

The  following  discussion  of  exposure  assessment  at  a  multi-user  site  focuses  on  the  tem¬ 
poral  and  spatial  distribution  of  the  stressors  of  interest  (i.e.,  contaminants  associated 
with  suspended  or  deposited  sediment).  Also  important  in  all  cases  but  not  addressed 
here  is  the  temporal  and  spatial  distribution  of  the  organisms  or  receptors  of  concern  in 
relation  to  the  distribution  of  the  stressor.  It  is  important  to  know  which  species  or  recep¬ 
tors  of  concern  are  likely  to  encounter  the  stressors,  the  frequency  and  duration  of  con¬ 
tact,  the  proportion  of  the  receptor  population  encountering  the  stressor,  and  other 
relevant  ecological  considerations  in  addition  to  the  physical  considerations  described 
below. 

During  placement.  During  dredged  material  placement  at  the  site,  suspended  sedi¬ 
ment  is  introduced  throughout  the  vertical  extent  of  the  water  column  as  the  dredged 
material  is  discharged  from  the  transportation  vessel  or  pipeline.  This  continues,  often 
on  a  frequent  intermittent  basis,  until  the  dredging  project  being  evaluated  is  complete, 
then  ceases  until  the  next  dredging  project  begins.  When  dredged  material  is  released 
near  the  water  surface,  most  of  the  material  descends  rapidly  to  the  bottom  in  a  mass 
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descent  phenomenon.  However,  a  few  percent  of  the  total  mass  of  sediment  discharged 
may  remain  in  suspension  long  enough  for  short-term  dispersion  in  the  water  column, 
providing  a  potential  exposure  mechanism  for  water  column  organisms. 

Exposure  to  deposited  sediment  during  placement  is  a  theoretical  possibility  but  seldom 
appears  to  be  a  major  focus  of  concern  for  contaminants.  Organisms  directly  buried  by 
the  deposited  material  may  face  major  physical  stresses,  but  the  burial  usually  overshad¬ 
ows  any  chemical  concerns  on  a  short-term  basis.  Because  new  sediment  is  more  or  less 
continuously  being  deposited  throughout  the  placement  operation,  site  recolonization 
does  not  begin  in  earnest  until  after  placement  is  complete.  Thus,  there  are  limited 
benthos  present  to  be  exposed  to  deposited  sediment  until  after  placement  is  completed. 

Post  placement.  After  the  discharge  operation  is  completed,  sediment  may  be  resus¬ 
pended  from  the  site  by  wave  and/or  current  action.  This  may  be  a  mechanism  of  expo¬ 
sure  for  organisms  on  and  off  the  site,  depending  on  the  distribution  of  the  suspended 
sediment.  After  the  project  is  completed,  recolonization  of  the  deposited  dredged  mate¬ 
rial  by  larvae  and  mobile  organisms  may  occur.  The  deposited  material  probably  pro¬ 
vides  the  most  direct  exposure  opportunity  for  infauna  and  epifauna  and  for  the 
organisms  that  feed  on  them. 

5.3.4.2  Exposure  assessment  approach 

The  following  approach  to  exposure  assessment  is  presented  at  the  conceptual  level. 
Clearly,  many  aspects  remain  to  be  worked  out  on  a  case-by-case  basis,  and  some  will 
require  further  research  and  development  before  routine  implementation  becomes  prac¬ 
tical.  However,  the  conceptual  approach  is  appropriate  and  provides  a  framework  to 
guide  development  of  practical  implementation  procedures. 

During  ptacement.  The  USEPA  and  USAGE  procedure  (1991)  for  evaluation  of  water 
column  impacts  was  designed  specifically  to  address  exposure  to  suspended  sediments  at 
the  site  during  discharge.  This  approach  uses  a  mathematical  model  developed  specifi¬ 
cally  for  this  purpose  to  describe  the  temporal  and  spatial  distribution  of  suspended  sedi¬ 
ments  and  associated  chemicals  during  and  immediately  following  a  discharge  event.  The 
model  can  describe  concentrations  of  dissolved  contaminants  and  thus  is  compatible 
with  chemical-based  effects  endpoints  for  effects  assessment  purposes.  It  can  also  de¬ 
scribe  suspended  sediment  concentrations  and  so  can  be  used  with  toxicity  or  other  ef¬ 
fects  tests  based  on  exposure  to  suspended  sediment. 

Post  placement.  Resuspension  of  deposited  dredged  material  after  discharge  is  com¬ 
pleted  can  be  modeled  mathematically  or  estimated  by  other  means.  Once  the  temporal 
and  spatial  distribution  of  suspended  sediments  or  associated  chemicals  is  known,  these 
distributions  can  be  used  with  either  chemical-  or  biological-based  measurement  end¬ 
points  in  effects  assessment. 

Long-term  exposure  to  the  deposited  dredged  material  mound  depends  less  on  move¬ 
ment  of  the  sediment  than  on  movement  of  receptors  onto  the  sediment.  This  can  occur 
either  through  larval  setting  or  movement  of  adults,  including  feeding  on  infauna  at  the 
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site.  Exposure  to  deposited  sediment  can  be  evaluated  in  relation  to  either  biological  or 
chemical  effects  endpoints.  Exposure  to  deposited  sediment  is  controlled  predominately 
by  biotic  and/or  physical-chemical,  rather  than  physical,  factors. 

5.3.4.3  Exposure  ca/cu/af/on  concept 

In  the  simple  case,  material  from  only  one  dredging  project  will  be  placed  at  the  site. 
However,  at  multi-user  sites,  the  complexity  of  dredged  material  from  multiple  projects 
placed  at  the  site  over  time  must  be  addressed.  Exposure  to  deposited  sediment  in  rela¬ 
tion  to  either  biological  or  chemical  effects  measurement  endpoints  is  controlled  largely 
by  movement  of  receptors  onto  the  deposit.  Exposure  is  assessed  considering  the  size  of 
the  deposit,  frequency  and  duration  of  receptor  contact  with  the  sediment  stressors,  pro¬ 
portion  of  the  receptor  populations  exposed,  etc.  When  a  second  dredging  project  is 
placed  at  the  same  site,  exposure  to  the  new  deposit  is  evaluated  in  the  same  way.  How¬ 
ever,  total  exposure  is  now  the  cumulative  exposure  to  the  first  project  mound  plus  the 
exposure  to  the  second  project  mound.  If  the  second  project  mound  partially  or  entirely 
covers  the  first  mound,  the  reduction  in  surface  area  of  the  first  mound  accessible  to  re¬ 
ceptors  must  be  considered  in  calculating  the  cumulative  exposure  to  receptors  as  a  result 
of  the  second  project.  The  same  process  is  applied  to  calculating  cumulative  exposure  as 
a  result  of  placement  of  subsequent  mounds. 

For  multi-user  sites,  cumulative  exposure  is  evaluated  as  shown  in  the  following  concep¬ 
tual  illustration  (based  on  Figure  5-5).  The  approach  is  the  same  whether  exposure  is 
evaluated  in  relation  to  sediment  chemistry  or  in  relation  to  sediment  toxicity  or  other 
biological  effects  endpoints.  When  placement  of  the  first  project  is  complete,  exposure  of 
receptors  at  the  site,  all  other  things  being  equal,  is  a  function  “f  ”  of  the  surface  area  of 
“mound”  1.  This  “f”  involves  aspects  of  chemical  bioavailability  and  co-occurrence  of 
receptors  at  the  site. 

Cumulative  exposure  as  a  result  of  adding  a  particular  dredged  material  to  the  placement 
site  can  be  illustrated  by  the  following  equations.  These  equations  are  not  intended  for 
mathematical  solution  but  are  intended  simply  to  illustrate  the  concept  of  quantifying 
cumulative  exposure,  assuming  additivity: 

Xi  =  f(Ai) 

where  X;i^  =  cumulative  exposure  at  the  site  after  placement  of  project  1  and 
Aj^  =  surface  area  of  mound  1. 

Project  2  is  placed  at  the  site  in  such  a  way  that  mound  2  does  not  overlap  mound  1.  Af¬ 
ter  project  2  is  completed,  the  cumulative  exposure  (X2)  of  receptors  at  the  site  will  be 

X2  =  fI(A2)-H(Ai)]. 

The  third  project  to  be  placed  at  the  site  is  predicted  to  create  a  mound  that  partially 
covers  both  mounds  1  and  2.  After  project  3  is  completed,  the  cumulative  exposure  (X3) 
of  receptors  at  the  site  will  be 

X3  =  fl(A3)  +  (A2  -  A2c)  +  (Ajl  -  Aj^c)] 
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Figure  5-5  Conceptual  illustration  of  cumulative  exposure  to  deposited  dredged  material  at  a  multi-user 
aquatic  site  as  “mounds”  of  dredged  material  from  different  projects  are  sequentially  added 


where  A2c  =  surface  area  of  mound  2  covered  by  subsequent  mounds  and 
surface  area  of  mound  1  covered  by  subsequent  mounds. 

The  fourth  project  is  placed  at  the  site  in  such  a  manner  that  it  partially  overlaps  mound 
3  and  a  portion  of  mound  1  not  covered  by  mound  3.  The  cumulative  exposure  (X^  of  re¬ 
ceptors  at  the  site  after  placement  of  the  mound  from  project  4  will  be 

X4  =  fI(A4)  +  (A3  -  +  (A2-  A2c)  +  (A^  “  Ajj.)]. 

Note  that  A^^  here  has  a  larger  value  after  the  placement  of  mound  3  because  mound  4 
covers  part  of  mound  1  that  was  not  previously  covered. 

Subsequent  projects  placed  at  the  site  would  be  addressed  by  extension  of  this  approach. 

5.3.5  Risk  characterization 

Risk  characterization  is  the  final  phase  of  the  risk  assessment  process.  Major  elements  of 
this  phase  include  integrating  exposure  and  effects  information,  summarizing  relevant 
supporting  information  in  a  weight-of-evidence  discussion,  interpreting  the  ecological 
importance  of  the  risks,  and  discussing  the  results  of  the  assessment  with  the  risk  man¬ 
ager  and  other  stakeholders.  In  this  section,  these  principles  are  discussed  with  respect  to 
the  placement  of  dredged  material  at  a  multi-user  aquatic  site.  A  hypothetical  character- 
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ization  of  potential  risks  to  flounder  abundance  in  the  vicinity  of  such  a  site  is  used  as  an 
example. 

The  approaches  used  for  characterizing  risk  associated  with  dredged  material  at  an 
aquatic  site  will  depend  on  the  purpose  of  the  assessment  as  well  as  on  the  resources  (per¬ 
sonnel  and  time)  available  for  the  assessment.  Decisions  concerning  selection  of  risk 
characterization  approaches  should  have  been  made  as  part  of  the  conceptual  model.  In 
general,  empirical  and  mechanistic  approaches  may  be  used  to  characterize  risk  (Wiegert 
and  Bartell  1994). 

Examples  of  empirical  approaches  include  joint  distributional  analysis  and  the  quotient 
method.  In  the  quotient  method  (Barnthouse  et  al  1986),  a  point  estimate  of  exposure  is 
divided  by  a  point  estimate  of  an  effect  level  or  toxicological  benchmark  concentration, 
i.e.,  by  an  effect  measurement  endpoint.  For  example,  in  a  preliminary  evaluation  of  the 
risk  to  flounder  of  chemicals  released  into  the  water  column  during  placement  of  dredged 
material,  an  estimate  of  a  chemical  concentration  might  be  divided  by  the  corresponding 
acute-toxicity  water  quality  criterion.  If  the  quotient  value  is  much  less  than  one,  a  lack  of 
potential  acute  toxicity  might  be  inferred;  if  the  quotient  value  is  much  greater  than  one, 
potentially  substantial  effects  might  be  inferred.  Intermediate  quotient  values  generally 
indicate  the  need  for  more  detailed  evaluation.  While  the  quotient  method  offers  an  ef¬ 
ficient  means  for  risk  screening,  it  allows  qualitative  rather  than  quantitative  estimates  of 
uncertainty.  In  addition,  the  quotient  method  does  not  facilitate  management  of  risks 
through  exposure  reduction,  since  the  meaning  of  a  quotient’s  drop  (from,  e.g.,  5.4  to 
2.1),  which  results  from  reduced  exposure,  cannot  be  readily  interpreted  in  terms  of  po¬ 
tential  ecological  effects. 

An  alternative  empirical  approach  to  the  quotient  method  for  integrating  risk  is  the  use 
of  joint  probability  analysis  {e.g.,  Cardwell  et  al.  1993).  In  this  approach,  distributions  of 
both  exposure  concentrations  and  species  sensitivity  values  {i.e.,  effects  measurement 
endpoints)  are  graphed.  For  example,  consider  the  pathway  to  flounder  through  contami¬ 
nated  prey.  In  this  case,  a  range  of  predicted  exposure  values  for  tissue- contaminant  lev¬ 
els  might  be  compared  to  a  distribution  of  toxicity  effects  obtained  from  a  feeding  study 
with  flounder.  The  overlap  or  lack  thereof  in  the  exposure  and  toxicity  functions  could  be 
used  to  predict  risk.  A  potential  limitation  of  this  approach  is  the  need  to  develop  or  vali¬ 
date  and  calibrate  an  acceptable  food  web  model.  However,  other  applications  of  distri¬ 
butional  methods  that  rely  upon  existing  data  have  been  described  {e.g.,  Cardwell  etal. 
1993). 

Mechanistic  (or  process)  models  use  hypotheses  about  causal  processes  to  simulate  the 
responses  of  biological  systems  to  stressors.  In  the  flounder  example,  if  the  pathway  in¬ 
volved  concerns  for  flounder  population  changes  resulting  from  reduced  prey  abun¬ 
dance,  changes  predicted  from  population  models  for  important  prey  species  such  as 
amphipods  or  worms  might  be  coupled  to  a  flounder  population  model.  While  such 
models  can  generate  probabilistic  data  and  facilitate  exploration  of  a  range  of  alternative 
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exposure  scenarios,  necessary  input  data  may  be  unavailable  and  associated  uncertainty 
levels  may  be  high. 

Whatever  method  is  used  to  integrate  exposure  and  effects  data,  the  risk  should  be  ex¬ 
pressed  in  terms  of  effects  on  an  assessment  endpoint.  For  example,  if  the  measurement 
endpoint  for  water-column  chemical  effects  on  flounder  is  toxicity  to  a  surrogate  species, 
the  risk  characterization  should  express  risk  to  the  flounder  through  an  evaluation  of  the 
extrapolation  from  the  surrogate  to  flounder. 

Major  uncertainties  associated  with  such  extrapolations  and  with  the  entire  risk  assess¬ 
ment  process  should  be  aggregated  and  evaluated  during  risk  characterization.  If  a  quo¬ 
tient  method  is  used,  only  a  qualitative  estimate  of  uncertainty  may  be  possible.  On  the 
other  hand,  if  mechanistic  models  were  used,  quantitative  uncertainty  estimates  may  be 
available.  Other  major  uncertainties  associated  with  risk  assessment  should  be  identified 
as  well.  Examples  include  uncertainties  associated  with  a  range  of  extrapolations  (from 
toxicity  to  surrogate  species  to  toxicity  to  flounder;  from  laboratory  bioassays  with  whole 
sediments  to  field  effects)  and  with  assumptions  regarding  exposure  routes.  For  example, 
when  determining  the  risk  to  flounder  feeding  in  the  vicinity  of  the  site,  there  may  be 
considerable  uncertainty  as  to  the  proportion  of  prey  consumed  that  is  contaminated 
from  the  site,  relative  to  the  proportion  that  is  unaffected  by  the  dredged  material. 

Information  bearing  on  the  risk  assessment  may  be  obtained  from  sources  other  than  the 
analyses  used  in  the  risk  integration.  Such  information  should  be  summarized  and  evalu¬ 
ated  as  part  of  a  weight-of-evidence  discussion  (USEPA  1992).  Using  multiple  lines  of 
evidence  can  be  quite  valuable,  especially  when  the  individual  elements  are  not  conclu¬ 
sive  (USEPA  1994).  In  the  example  of  evaluating  risks  to  flounder  from  consuming  con¬ 
taminated  prey,  flounder  food  web  models  might  be  evaluated  in  conjunction  with  any 
applicable  monitoring  data  on  flounder  or  prey  populations  in  the  vicinity  of  the  site. 

Once  the  risks  and  uncertainties  have  been  summarized,  the  ecological  importance  of  the 
risk  should  be  evaluated.  While  ecological  importance  has  many  facets  (Harwell  et  al 
1994),  there  are  several  aspects  particularly  relevant  to  dredged  material.  Consideration 
of  the  temporal  scale  may  be  important.  For  example,  rate  of  recolonization  of  benthic  in¬ 
vertebrates  at  the  site  should  be  considered.  Even  if  the  risk  manager  is  not  concerned 
with  dredged  material  effects  on  benthic  invertebrates  within  the  site,  the  rate  of  recolo¬ 
nization  of  the  newly  deposited  sediments  is  one  factor  that  influences  how  and  when 
flounder  feeding  in  the  area  may  begin  to  encounter  contaminated  prey. 

Temporal  scale  is  also  important  if  multiple  dredging  projects  are  to  be  placed  at  a  par¬ 
ticular  site.  In  this  case,  the  risk  assessment  should  consider  changes  in  exposure  result¬ 
ing  from  burial  and  reburial  of  previously  applied  sediments  (Section  5.3.4.3).  Changes 
in  benthic  invertebrate  populations  (as  prey  for  flounder)  that  result  from  repeated  dis¬ 
turbances  should  also  be  considered. 

Spatial  scale  is  another  aspect  of  ecological  importance  for  placement  of  dredged  mate¬ 
rial  at  aquatic  sites.  The  areal  extent  of  bottom  covered  by  deposits  is  obviously  impor- 
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tant  in  evaluating  the  severity  of  the  effect  on  benthic  invertebrates.  If  the  concern  is  for 
flounder,  the  foraging  area  of  the  fish  needs  to  be  considered  relative  to  the  area  of  the 
site.  Spatial  scale  is  also  critical  in  determining  the  nature  and  extent  of  effects  on  floun¬ 
der.  A  severe  effect  on  individual  flounder  in  the  immediate  vicinity  of  the  site  may  be  of 
negligible  consequence  to  the  flounder  population  in  the  area. 

When  the  risk  assessment  has  been  completed,  the  results  are  discussed  with  the  risk 
manager.  Wiegert  and  Bartell  (1994)  summarize  a  number  of  factors  that  can  lead  to  ef¬ 
fective  presentations  of  risk  assessment  findings.  Some  important  areas  are  highlighted 
below: 

•  Present  results  in  terms  of  the  assessment  endpoints  that  were  chosen  in 
conjunction  with  the  risk  manager  in  the  problem  formulation  stage.  This  will 
help  ensure  that  the  risk  assessment  is  recognized  as  relevant  to  the  needs  of  the 
risk  manager.  If  the  assessment  endpoint  is  flounder  abundance,  do  not  present 
risk  in  terms  of  a  surrogate  test  species. 

•  Where  appropriate,  describe  a  range  of  risks.  If  dredged  material  is  placed  at  a 
site  with  little  lateral  dispersion,  the  risk  to  benthic  invertebrates  associated 
with  that  material  may  be  relatively  uniform  across  the  site.  However,  if  there  is 
substantial  spreading  of  the  sediment  and  mixing  with  adjacent  sediments,  it 
may  be  appropriate  to  provide  a  range  of  risk  estimates  associated  with  the 
range  of  exposure  conditions  expected  on  different  sub-areas  of  the  site. 

•  Summarize  major  assumptions  and  uncertainties.  While  it  is  neither  necessary 
nor  desirable  to  list  every  possible  uncertainty,  the  risk  manager  should  clearly 
understand  major  limitations,  data  gaps,  assumptions,  and  uncertainties. 
Examples  might  include  1)  exposure/effects  pathways  not  considered  in  the 
assessment,  such  as  direct  burial  of  disposal  site  flounder  or  benthic  inverte¬ 
brates  by  sediments,  or  2)  extrapolations  from  toxicity  tests  with  a  few  inverte¬ 
brate  species  to  acute  and  chronic  effects  on  the  benthic  invertebrate 
community. 

It  may  also  be  appropriate  to  describe  potential  follow-on  activities,  such  as  monitoring, 
to  evaluate  the  predictions  of  the  risk  assessment. 

Recovery  potential  has  been  essentially  unaddressed  by  either  research  and  development 
or  U.S.  regulatory  policy.  This  is  a  critical  gap  since  it  is  often  assumed  that  when  stres¬ 
sor  input  stops,  the  ecosystem  will  recover  in  a  normal  and  acceptable  pattern  and  rate. 
This  may  or  may  not  be  the  case  (Chapter  11).  Only  well-designed  monitoring  programs 
based  on  testable  hypotheses  will  be  able  to  assess  recovery  potential  and  provide  a  ba¬ 
sis  for  quantitatively  incorporating  consideration  of  recovery  into  future  risk  assess¬ 
ments. 


5.4  Research  needs 

A  number  of  topics  that  are  important  to  the  application  of  risk  assessment  to  navigation 
dredging  could  benefit  greatly  from  additional,  carefully  focused  research,  development, 


SETAC  Press 


Session  3;  Navigational  Dredging,  Chapters 


69 


and  refinement.  Some  of  the  more  urgent  or  promising  of  these  are  identified  briefly 
below: 

•  Chronic  toxicity  effects  endpoint  measurement  techniques  using  dissolved  and 
sediment-sorbed  contaminants 

•  Shortcut  toxicity  tests,  perhaps  using  concentration  techniques,  as  an  alterna¬ 
tive  for  chronic,  long-term  tests 

•  Exposure-time  relationships  in  chronic  whole-sediment  toxicity  tests  analogous 
to  exposure-dilution  relationships  in  aquatic  toxicity  tests 

•  Ecological  relevance  of  bioassay  response  for  in  situ  effects,  that  is,  extrapolation 
of  laboratory  test  responses  to  field  conditions.  This  could  involve  such  issues  as 

-  application  of  toxicity  test  results  with  individual  organisms  to 
population-level  effects, 

-  application  of  tests  under  laboratory  conditions  to  effects  under  “real 
world”  exposures,  and 

-  adaptation  of  effects  test  methods  to  take  into  account  multiple 
exposures  to  different  sediments  in  the  field. 

•  Confounding  factors  in  laboratory  toxicity  tests,  such  as  effects  on  test  species  of 
ammonia,  hydrogen  sulfide,  pH  and  salt  shifts,  etc.,  in  laboratory  toxicity  tests 

•  Conceptual  framework  for  shortcut  identification  of  sediments  having  negli¬ 
gible  contaminant  risks,  perhaps  based  on  lack  of  effect  in  appropriate  tests 

•  Better  methods  and  guidance  for  quantitatively  predicting  cumulative  exposure 
at  the  placement  site 

•  Methods  for  quantitatively  predicting  recovery  potential  at  the  placement  site 

•  Cost-effective  procedures  for  measuring  effects  and  exposure  endpoints 

5.5  Conclusions  and  recommendations 

•  The  ERA  process  is  a  useful  framework  for  conceptualizing  problems,  conduct¬ 
ing  testing  and  evaluations,  and  communicating  technical  results  to  the  dredged 
material  manager  and  stakeholders. 

•  A  tiered  risk  assessment  process  is  an  efficient  approach  for  assessing  dredged 
material. 

•  In  relation  to  navigation  dredging,  assessment  endpoints  and  their  relationship 
to  measurement  endpoints  have  historically  been  poorly  developed. 

•  Actual  quantitative  exposures  at  placement  sites  have  been  poorly  documented. 

•  Chemical  and/ or  biological  measurements  by  themselves  are  insufficient  to 
support  management  decisions  about  dredged  material.  Exposure  assessment 
must  also  be  incorporated  into  a  complete  risk  assessment  before  management 
decisions  can  be  properly  made. 

•  Managing  exposure  at  the  placement  site  is  the  key  to  managing  ecological  risks 
associated  with  dredged  material. 
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•  Dredged  material  should  be  evaluated  and  managed  in  the  context  of  the 
placement  site  ecosystem.  Attention  to  appropriate  temporal  and  spatial  scales 
is  essential. 

•  Site  selection,  dredged  material  testing  and  evaluation,  site  management,  and 
monitoring  should  be  viewed  as  a  continuum,  with  feedback  leading  to  refine¬ 
ment  and  improvement  of  the  methods  used  for  all  four  components. 

•  The  scientific  basis  for  quantifying  the  ecological  importance  of  observed  or 
predicted  changes  in  assessment  endpoints  is  poorly  developed. 

•  Monitoring  can  be  useful  in  the  field  validation  of  ERA. 
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Perspectives  on  sediment  ecological 
risk  assessment  for  hazardous  waste  sites 

Charles  A.  Menzie 


6*1  Introduction 

This  chapter  provides  some  perspectives  concerning  the  application  of  ERA  to  sediment 
contamination  at  hazardous  waste  sites.  It  is  not  a  comprehensive  review  but  rather  is 
intended  to  foster  discussion.  In  some  places,  reference  is  made  to  other  chapters  where 
additional  detail  can  be  found.  Human  health  risk  issues  are  not  addressed,  although 
both  direct  contact  and  transfer  of  chemicals  to  food  items  are  recognized  pathways  of 
exposure  to  humans  from  sediments  at  hazardous  waste  sites. 

Sediment  risk  analyses  for  hazardous  waste  sites  can  be  both  retrospective  and  predictive 
in  nature,  and  they  should  focus  on  specific  questions.  Examples  of  common  questions 
that  sediment  risk  analyses  at  hazardous  waste  sites  serve  to  address  include  these: 

1)  Are  chemical  levels  in  sediments  toxic  to  species  living  or  foraging  in  stream  or 
wetland  sediments? 

2)  Are  chemicals  present  in  sediments  at  levels  that  may  affect  the  abundance  or 
species  composition  of  benthic  invertebrates? 

3)  Are  chemicals  in  sediments  being  bioaccumulated  and  transferred  via  food 
webs  to  fish  and  wildlife  species  at  levels  that  may  affect  the  survival  or  repro¬ 
ductive  success  of  these  species? 

4)  What  is  the  likelihood  that  chemicals  in  sediments  may  be  released  to  the  water 
column  as  a  result  of  future  hydrological  events?  Could  the  resulting  exposure 
levels  cause  unacceptable  environmental  effects? 

5)  What  is  the  likelihood  that  chemicals  will  be  buried  and  removed  as  sources  of 
exposure? 

6)  What  are  the  relative  risk  reductions  associated  with  alternative  remedial 
options? 

7)  What  are  the  risks  associated  with  implementing  a  specific  remedial  action? 

6.2  Historical  perspective 

Sediment  risk  assessment  at  hazardous  waste  sites  has  a  history  rooted  in  three  general 
areas:  1)  the  development  of  chemical  and  biological  tools  for  assessing  sediment  condi¬ 
tions,  2)  the  conduct  of  a  number  of  high-profile  site-specific  assessments  in  particular 
areas  of  the  country,  and  3)  the  emergence  of  ERA  as  an  organizing  framework. 
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Over  the  past  few  decades,  numerous  tools  have  been  developed  and  applied  to  gain  a 
better  understanding  of  sediment  conditions.  To  a  large  extent,  these  tools  are  designed 
to  determine  the  status  of  the  sediments  either  in  terms  of  toxicity,  geochemistry,  physi¬ 
cal  conditions,  structure  of  the  benthic  invertebrate  community,  habitat  suitability,  or 
health  of  fish  populations  using  the  area.  Recent  tools  include  methods  for  measuring 
bioavailable  fractions  of  organic  chemicals  and  metals.  With  this  expanding  tool  kit,  sci¬ 
entists  are  able  to  examine  the  general  status  of  sediment  conditions  and  examine  trends 
either  spatially  or  temporally.  The  U.S.  National  Oceanic  Atmospheric  Administration  s 
(NOAA  s)  National  Status  and  Trends  Program  (NS&T)  and  the  USEPAs  Environmental 
Monitoring  and  Assessment  Program  (EMAP)  are  examples  of  large-scale  efforts  that  use 
a  battery  of  tools  to  assess  conditions  (NOAA  1991;  Reifsteck  et  al  1992).  While  these 
programs  —  and  many  on  smaller  scales  —  are  useful  for  examining  conditions  and  for¬ 
mulating  questions  concerning  risks  associated  with  various  stressors,  the  programs 
themselves  are  not  examples  of  risk  analyses. 

A  number  of  high-profile  sediment  contamination  problems  in  the  U.S.  and  elsewhere 
led  to  the  development  of  area  or  regional  approaches  for  evaluating  risks  associated  with 
contaminated  sediments  during  the  1970s  and  1980s,  prior  to  publication  of  national 
HHRA  or  ERA  paradigms  or  frameworks.  Most  of  the  high-profile  cases  involved  retro¬ 
spective  assessments  associated  with  historical  releases  of  contaminants  such  as  PCBs  in 
the  Hudson  River  and  New  Bedford  Harbor,  Kepone  in  the  James  River,  and  numerous 
organic  and  inorganic  contaminants  in  embayments  of  Puget  Sound.  These  and  other 
high-profile  cases  demanded  that  decisions  be  made  concerning  the  need  for  remediation 
and  the  nature  and  extent  of  remediation.  The  analyses  that  were  undertaken  to  inform 
those  decisions  brought  together  many  of  the  tools  mentioned  above  into  frameworks 
that  could  be  used  to  “get  a  handle”  on  problems  and  determine  the  need  for  action. 
During  the  1970s  and  1980s,  there  were  also  a  number  of  high-profile  predictive  assess¬ 
ments  including  ocean  dumping  and  offshore  oil  and  gas  exploration  (International 
Ocean  Disposal  Symposium  [lODS]  Special  Symposium  1983;  Duedall  etal  1985;  Nocito 
et  al  1989).  These  retrospective  and  predictive  programs  had  elements  of  risk  analyses, 
although  they  were  not  explicitly  stated  as  such. 

In  the  late  1980s  and  early  1990s,  hazardous  waste  site  programs  in  the  U.S.,  Canada,  and 
elsewhere  identified  risk  assessment  as  a  critical  tool  for  evaluating  the  need  for  and  effi¬ 
cacy  of  cleanup  options.  Most  attention  was  initially  given  to  assessing  human  health 
risks  and  formal  guidance  was  developed  in  the  mid-1980s  to  support  the  U.S.  Superfund 
Program.  Ecological  risk  assessments  have  now  become  a  key  component  of  the  remedial 
investigation  and  feasibility  studies  (RI/FS)  at  Superfund  sites  (USEPA 1994),  state  sites, 
and  sites  in  Canadian  provinces  (Gaudet  et  al.  1995).  While  risk  analyses  certainly  pre¬ 
ceded  the  development  of  frameworks,  guidance,  and  terminology,  the  process  is  now 
more  explicitly  defined  so  that  it  can  be  discussed  in  scientific  and  policy  forums. 

One  of  the  major  benefits  of  risk  assessment  is  that  the  process  provides  an  organizing 
framework  for  approaching  problems  wherein  tools  and  experience  can  be  brought  to¬ 
gether  in  an  appropriate  way  to  address  the  questions  related  to  the  decisions  at  hand. 
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One  of  the  most  important  aspects  of  the  process  is  formulating  the  questions  the  assess¬ 
ments  are  intended  to  address.  Questions  set  the  direction  for  the  technical  aspects  of 
both  human  health  and  ecological  risk  analyses.  An  assessment  that  does  not  begin  with 
a  clear  question  is  like  a  ship  without  a  rudder.  No  matter  how  sophisticated  the  equip¬ 
ment  aboard,  the  ship  will  probably  never  reach  a  satisfactory  destination. 

6.3  Guidance 

Ecological  risk  assessment  guidance  is  being  developed  by  the  USEPA  for  Superfund  sites, 
by  Environment  Canada,  and  by  many  states  and  provinces.  Some  states  —  Wisconsin, 
for  example  —  have  developed  guidance  specific  for  sediments.  For  the  most  part,  guid¬ 
ance  is  based  on  the  USEPAs  Framework  for  Ecological  Risk  Assessment  (USEPA  1992), 
often  using  the  guidance  as  a  starting  point  (Menzie-Cura  &  Associates  1996). 

The  USEPAs  Superfund  guidance  illustrates  the  steps  that  may  be  involved  in  assessing 
risks  associated  with  contaminated  sediments  at  hazardous  waste  sites  (Table  6-1).  The 
approach  incorporates  the  concepts  of  ERA  articulated  in  the  USEPAs  framework  into  a 
procedure. 

As  indicated  in  Table  6-1,  a  stepwise  approach  is  suggested,  beginning  with  preliminary 
problem  formulation.  Guidance  regarding  phased  or  tiered  approaches  can  be  found  in 
Environment  Canadas  guidance  (Canadian  Council  of  the  Ministers  of  the  Environment 
[CCME]  1995)  and  in  guidelines  developed  by  a  number  of  states.  For  example,  the  Mas¬ 
sachusetts  Department  of  Environmental  Protection  (MA  DEP)  Ecological  Risk  Assess¬ 
ment  Guidance  identifies  two  stages,  Stage  I:  Environmental  screening  and  Stage  II;  Risk 
characterization  (quantitative  process),  and  has  developed  specific  guidance  on  how  each 
of  these  phases  would  be  implemented.  Phased  or  tiered  approaches  permit  the  assess¬ 
ment  to  fit  the  size,  scope,  and  implication  of  the  decisions.  A  possible  constraint  is  the 
schedule  imposed  upon  the  investigation  process;  a  set  clock  for  completion  of  a  study 
may  not  be  compatible  with  a  phased  study  unless  the  phases  are  identified  at  the  onset. 

6.4  Sediment  risk  assessment  tools  for  hazardous  waste  sites 

A  broad  range  of  tools  has  been  developed  to  evaluate  physical,  chemical,  or  biological 
aspects  of  sediments.  As  already  noted,  these  tools  may  be  applied  for  various  purposes 
including  the  analysis  of  risks  at  hazardous  waste  sites.  To  make  effective  use  of  tools  in 
a  sediment  risk  assessment  for  a  hazardous  waste  site,  the  questions  and  potential  deci¬ 
sions  should  be  understood  at  the  beginning  of  the  process.  The  amount  of  information 
needed  to  answer  the  questions  or  reach  a  decision  may  vary  depending  on  the  scope  of 
the  problem  and  the  certainty  to  which  an  answer  is  desired.  Thus,  risk  analyses  and 
tools  may  be  employed  in  a  tiered  fashion,  leading  from  simpler  (generally  conservative) 
to  more  sophisticated  approaches. 

A  few  of  the  common  biological  assessment  tools  that  have  been  used  for  risk  assessment 
of  contaminated  sediments  are  described  in  Table  6-2  with  reference  to  their  use  in  risk 
assessment  (see  Chapter  18  for  additional  detail). 
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Table  6-1  Steps  in  ecological  risk  assessment  for  Superfiind  sites  from  draft  guidance 
Step  Description 


Preliminary  problem  formulation,  ecological  effects  evaluation 

Preliminary  exposure  estimate  and  risk  calculation 

Problem  formulation:  assessment  endpoint  selection 

Problem  formulation:  conceptual  model,  measurement  endpoint  selection, 
study  design 


5  Site  assessment  for  sampling  feasibility 

6  Site  investigation 

7  Risk  characterization  (integration  of  exposure  and  effects) 

8  Risk  management 
Source:  USEPA 1994. 


In  addition  to  the  common  biological  assessment  tools  in  Table  6-2,  several  other  catego¬ 
ries  of  tools  are  worth  noting: 

1)  Sediment  fate  and  transport  studies  (laboratory  and  field)  including  deposition, 
burial,  and  scouring  studies.  Information  is  used  to  evaluate  exposures  (Chapter 

2)  Equilibrium  and  other  partitioning  methods  developed  to  determine  the 
bioavailable  fractions  of  chemicals  in  sediments.  Such  information  is  commonly 
used  in  exposure  assessments  (Chapter  18). 

3)  Models  of  fate,  transport,  and  bioaccumulation  (Chapter  17) 

6.5  Psychological  ^^dilemmas'^ 

There  are  a  number  of  psychological  dilemmas  that  risk  assessors  encounter  during  the 
scoping,  analysis,  and  characterization  phases  of  sediment  risk  assessments.  In  some 
cases,  these  can  limit  the  information  content  of  the  assessment  or  yield  results  in  a  form 
that  does  not  capture  the  information  adequately.  For  discussion  purposes,  I  have  iden¬ 
tified  these  dilemmas  as  1)  safety  in  numbers,  2)  the  testing  hot  potato,  3)  the  comfort 
zone,  and  4)  so  what  does  it  mean? 

6.5.1  Safely  in  numbers 

Numerical  criteria  or  guidelines  —  derived  either  on  a  generic  or  site-specific  basis  — 
appear  to  be  highly  desirable  to  risk  managers  seeking  a  bright  line  against  which  to  com¬ 
pare  a  chemical  measurement.  Research  on  risk  perception  and  communication  has 
shown  that  people  relate  most  easily  to  the  question,  “Is  it  safe?,”  if  clear  benchmarks  are 
provided.  This  desire  led  to  the  development  of  ambient  W(^C  and  is  the  basis  for  ongo¬ 
ing  efforts  to  develop  SQC.  These  are  described  further  in  Chapter  18. 
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While  sediment  criteria  or  guidelines  (/>.,  numbers)  are  desirable  to  risk  managers  and 
can  be  useful  for  screening  sites  (especially  where  conditions  are  simple),  they  should  be 
used  with  caution  in  sediment  risk  assessment.  The  criteria  or  guideline  may  not  relate  to 
the  question  that  is  being  asked  in  the  risk  assessment,  and  thus  there  may  be  a  mismatch 
between  the  question  and  assessment  tool  (the  criterion).  Because  experience  has  shown 
that  sediment  contamination  problems  can  be  quite  complex  and  variable  from  one  place 
to  the  next,  criteria  and  guidelines  can  be  overprotective  or  underprotective  depending 
on  circumstances.  Equilibrium  partitioning  methods  apply  to  individual  chemicals  in 
simple  systems  and  may  not  be  appropriate  for  evaluating  complex  mixtures,  especially 
where  aging  has  occurred.  The  challenge  for  the  risk  assessor  and  risk  manager  is  in  de¬ 
veloping  an  understanding  of  the  strengths  and  limitations  of  chemical-specific  numbers 
within  the  context  of  the  risk  assessment  and  risk  management  decisions.  Further,  where 
numerical  values  fall  short,  the  risk  assessor  and  risk  manager  need  to  identify  appropri¬ 
ate  risk  communication  tools  for  use  in  management  decisions  (Chapter  18). 

6.5.2  The  testing  hot  potato 

This  might  best  be  described  as  “fear  of  the  unknown.”  This  fear  is  usually  expressed  by 
a  potentially  responsible  party  (PRP)  or  their  attorneys  during  planning  stages  of  an  as¬ 
sessment.  While  chemical  measurements  in  media  are  familiar  to  PRPs,  ecological  test- 
especially  toxicity  testing  and  tissue  analyses  —  can  be  unfamiliar  territory.  As  a 
result,  there  are  concerns  about  what  such  testing  will  yield  as  well  as  the  relevance  of  the 
testing  for  the  decision.  To  some  degree,  these  fears  are  well  founded  in  that  there  does 
not  exist  a  clear  framework  for  incorporating  such  information  into  assessment  of  risks, 
and  our  understanding  about  how  to  relate  measurements  to  assessment  endpoints  is 
still  evolving.  A  result  of  these  concerns  may  be  debate  about  the  scopes  of  the  assess¬ 
ments  when  these  are  cast  simply  as  measurement  programs. 

6.5.3  The  comfort  zone 

Many  scientists  and  managers  who  participate  in  risk  assessments  bring  with  them  a 
knowledge  and  experience  base  that  forms  their  personal  “comfort  zone”  regarding  what 
is  important  and  what  works  when  assessing  risks  associated  with  contaminated  sedi¬ 
ments.  As  a  result,  there  is  often  a  desire  to  apply  familiar  tools  or  approaches  from  past 
experience  to  new  situations.  These  may  or  may  not  be  the  best  tools  or  approaches  to 
use.  Because  people  do  not  share  common  values  or  a  common  knowledge  or  experience 
base,  conflicts  often  arise  concerning  what  is  important  and  how  the  assessment  should 
be  conducted.  These  can  be  best  handled  in  the  problem  formulation  and  planning 
stages  of  the  project,  during  which  participants  —  especially  the  risk  assessors  and  risk 
managers  —  should  be  open  minded. 

6.5.4  So  what  does  it  mean? 

Communication  between  the  stakeholders,  risk  assessor,  and  risk  manager  are  vitally 
important  during  the  planning  and  implementation  of  the  assessment  in  order  to  avoid 
a  huge  SO  WHAT?  question  at  the  end  of  the  analyses.  A  common  misconception  is  that 
the  ecologist  or  environmental  scientist  (risk  assessor)  is  the  one  who  knows  what  is  im- 
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portant  to  address  in  an  ERA  and  should,  therefore,  formulate  the  questions.  Where  this 
occurs,  risk  assessors  assume  the  roles  of  risk  assessor,  risk  manager,  and  stakeholder, 
bringing  to  the  problem  formulation  their  own  personal  set  of  values.  “What  is  impor¬ 
tant?”  is  not  solely  an  ecological  question.  It  is  also  a  question  of  “What  is  important  to 
people?”  If  the  assessment  is  disconnected  from  what  stakeholders  and  risk  managers 
consider  important  or  if  the  importance  of  the  assessment  cannot  be  established  in  a 
meaningful  way,  then  the  analysis  is  likely  to  fail  in  providing  information  useful  for 
decision-making. 

6.6  Recommendations  for  sediment  risk  assessment  at 
hazardous  waste  sites 

There  are  two  recommendations  for  future  risk  assessment  of  sediments  that  I  consider 
especially  important:  1)  develop  guidance  for  the  problem  formulation  phase  of  sedi¬ 
ment  risk  analyses  and  2)  utilize  a  weight-of-evidence  approach  within  a  tiered  or  phased 
strategy. 

6.6.1  Guidance  for  problem  formulation 

This  is  the  most  critical  stage  of  the  risk  assessment.  Based  on  a  review  of  many  case  stud¬ 
ies,  it  is  clear  that  the  strengths  and  limitations  of  ERAs  stem  from  how  well  the  problem 
was  formulated  from  the  onset.  The  most  important  thing  to  occur  in  problem  formula¬ 
tion  is  defining  the  questions  the  assessment  is  intended  to  address.  This  process  involves 
communication  among  the  stakeholders,  risk  manager,  and  risk  assessor.  These  form  the 
assessment  endpoints  to  be  evaluated  with  regard  to  sediments.  At  present,  there  is  very 
little  guidance  on  this  process  but  its  importance  can  be  underscored  by  the  adage,  “It  is 
more  important  to  do  the  right  thing,  than  to  do  the  thing  right.”  The  balance  of  the  as¬ 
sessment  —  how  to  do  it  right  —  begins  with  asking  the  right  questions. 

6.6.2  Utilize  weight-of-evidence  approach  within  tiered  or  phased 
strategy 

Most  risk  analyses  for  sediments  at  hazardous  waste  sites  will  involve  a  weight-of-evidence 
approach.  However,  there  is  no  consensus  on  exactly  what  this  means  or  how  it  should  be 
carried  out.  Published  definitions  include  the  following: 

Each  risk  estimate  will  have  its  own  assumptions  and  associated  uncertainties 
and  these  may  not  be  expressed  equivalently.  The  separate  lines  of  evidence 
must  be  evaluated,  organized  in  some  coherent  fashion,  and  explained  to  the 
risk  manager  so  that  a  weight-of-evidence  evaluation  can  be  made  (Suter  1993). 

Risk  description  has  two  primary  elements.  The  first  is  the  ecological  risk  sum¬ 
mary,  which  summarizes  the  results  of  the  risk  estimation  and  uncertainty 
analysis  and  assesses  confidence  in  the  risk  estimates  through  a  discussion  of 
the  weight  of  evidence  (USEPA 1992). 

For  many  Superfund  ecological  risk  assessments,  a  weight-of-evidence  approach 
will  be  used.  This  frequently  will  require  that  different  types  of  data  are  evalu- 
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ated  together.  These  types  of  data  may  include  toxicity  test  results,  assessments 
of  existing  impacts  on-site,  or  true  risk  calculations  comparing  estimated  expo¬ 
sure  doses  with  toxicity  values  from  the  literature  Balancing  and  interpreting  the 
different  types  of  data  can  be  a  major  task... the  strength  of  evidence  provided  by 
different  types  of  tests  and  the  precedence  that  one  type  of  study  has  over  an¬ 
other  should  already  have  been  determined....  This  will  insure  that  data  inter¬ 
pretation  is  objective  and  not  designed  (i.e.,  biased)  to  support  a  preconceived 
answer  (USEPA 1994). 

A  multi-agency/consultant  workgroup  established  in  Massachusetts  to  examine  the 
question  of  how  to  conduct  a  weight-of-evidence  approach  defined  the  process  as  follows: 

The  weight-of-evidence  approach  is  the  process  by  which  measurement 
endpoint(s)  are  related  to  an  assessment  endpoint  to  evaluate  if  there  is  a  signifi¬ 
cant  risk  of  harm  to  the  environment.  The  approach  is  planned  and  initiated  at 
the  Problem  Formulation  Stage  and  results  are  integrated  at  the  Risk  Character¬ 
ization  Stage  (MA  DEP  Workgroup  1995;  Menzie  etal  1996). 

This  definition  provides  an  explicit  link  between  risk  characterization  and  the  questions 
developed  —  assessment  endpoints  —  during  the  problem  formulation  phase.  The  Mas¬ 
sachusetts  workgroup  has  developed  a  qualitative  and  quantitative  procedure  based  on 
the  following  considerations: 

1)  Degree  of  confidence  or  weight  placed  in  each  measurement  endpoint 

2)  Response  of  the  measurement  endpoint 

3)  Extent  of  agreement  or  divergence  among  measurement  endpoints  judged  with 
respect  to  the  initial  weight  or  confidence 

The  Massachusetts  workgroup  has  applied  the  approach  to  case  studies  involving  sedi¬ 
ment  contamination  and  found  that  it  works  well  as  a  planning  and  communication  tool. 
It  has  the  advantage  of  making  transparent  the  risk  assessors’  understanding  of  the 
strengths  and  limitations  of  specific  lines  of  evidence  as  they  relate  to  a  particular  assess¬ 
ment  endpoint.  The  workgroup  has  reached  general  agreement  on  the  attributes  to  con¬ 
sider  in  developing  a  level  of  confidence  or  weight  associated  with  each  measurement 
endpoint  (Table  6-3).  In  the  weight-of-evidence  procedure,  ten  attributes  of  each  mea¬ 
surement  endpoint  are  evaluated.  For  a  given  assessment  endpoint,  the  quality  of  each 
measurement  endpoint  is  compared  with  respect  to  these  attributes;  those  measurement 
endpoints  with  the  highest  quality  for  the  most  attributes  are  given  the  greatest  weight  in 
the  overall  characterization  of  risk. 
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Table  6-3  Attributes  for  judging  measurement  endpoints 


I.  Attributes  related  to  strength  of 
association  between  assessment 
and  measurement  endpoints 

II.  Attributes  related  to  data 
quality 

III.  Attributes  related  to  study 
design  and  execution  in  relation 
to  the  assessment  endpoint 

•  Biological  linkage  between 
measurement  endpoint  and 
assessment  endpoint 

•  Extent  to  which  dka 
quality  objectives  are  met 

•  Site  specificity 

•  Utility  of  measure  forjudging 
environmental  harm 

•  Sensitivity  of  the  measurement 
endpoint  for  detecting  changes 

•  Correlation  of  stressor  to 
response 

•  Spatial  representativeness 

•  Temporal  representativeness 

•  Quantitativeness 

•  Use  of  a  standard  method 

Source:  MADEP 1995. 
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7.1  Introduction 

Risk  assessments  are  an  integral  part  of  the  RI/FS  conducted  as  part  of  any  waste- 
contaminated  site  cleanup  effort.  Ecological  risk  assessments  provide  a  basis  for  evaluat¬ 
ing  existing  risks  {i.e.,  baseline  conditions)  and  provide  a  framework  for  evaluating  man¬ 
agement  alternatives,  including  the  potential  effects  associated  with  remedial  alternatives 
under  consideration  for  the  site.  As  described  in  Chapter  1,  several  documents  have  pre¬ 
viously  outlined  and  offered  guidance  for  conducting  ERAs  {e.g.,  USEPA  1992).  Addi¬ 
tional  forthcoming  guidance  includes  the  draft  USEPA  document,  Ecological  Risk 
Assessment  Guidance  for  Superfund:  Process  for  Designing  and  Conducting  Ecological 
Risk  Assessments  (USEPA  1994a)  and  the  Canadian  approach  (Gaudet  et  al  1995).  How¬ 
ever,  existing  or  forthcoming  guidance  documents  do  not  specifically  address  the  issue  of 
assessing  the  risks  associated  with  sediments  at  contaminated  sites. 

The  four  basic  steps  involved  in  conducting  any  ERA  are  applicable  to  contaminated  site 
cleanups:  1)  problem  formulation  and  site  characterization,  2)  exposure  assessment,  3) 
effects  assessment,  and  4)  risk  characterization.  Each  risk  assessment  varies  as  to  the  level 
of  detail  required  in  the  above  steps.  The  need  for  detailed  information  is  determined  by 
regulatory  site  status,  the  goals  of  the  remedial  actions,  the  complexity  of  the  site,  the 
potential  magnitude  of  remedial  options,  and  the  difficulty  in  adequately  describing  ex¬ 
posure,  toxicity,  and  the  properties  of  potential  concern.  In  all  cases,  SERA  should  make 
optimal  use  of  all  available  data.  Ideally,  additional  empirical  data  should  be  collected 
only  to  fill  information  needs,  in  particular  substituting  site-specific  information  for  as¬ 
sumptions,  and  to  reduce  uncertainties  in  critical  factors  that  strongly  influence  risk  es¬ 
timates. 

An  SERA  should  be  conducted  in  a  tiered  or  phased  approach  (USEPA  1992).  Sediment 
ecological  risk  assessments  may  be  conducted  as  part  of  an  overall  site  risk  assessment 
and  may  need  to  be  integrated  with  the  overall  ERA.  An  initial  analysis  (Tier  1)  is  con¬ 
ducted,  using  very  conservative  assumptions  about  exposure  and  toxicity,  then  a  decision 
is  made  whether  and  how  to  proceed.  Typically,  this  initial  screening-level  risk  assess¬ 
ment  is  conducted  using  available  data  and  conservative  assumptions  about  exposure 
and  toxicity  or  using  existing  effects-based  benchmark  values  or  guidelines  {e.g.,  Smith  et 
al.  in  pressX  On  the  basis  of  this  screening-level  assessment,  conclusions  may  be  reached 
suggesting  that  ecological  risks  which  require  management  or  reduction  do  not  exist  or 
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are  insignificant.  Alternatively,  when  the  screening-level  SERA  indicates  that  there  is  a 
potential  risk,  the  subsequent  risk  assessment  may  be  used  to  prioritize  areas,  chemicals, 
and  species  at  the  site  and  to  decide  what  kinds  of  additional  data  are  necessary  to  de¬ 
velop  a  more  definitive  risk  estimate  (e.g.,  Tiers  1  and  2).  In  later  tiers,  additional  data 
may  be  collected  that  will  better  define  the  exposure-response  relationship  at  the  site.  At 
each  tier,  a  pass/fail  decision  is  made  whether  to  proceed  to  advanced  tiers  and,  if  so,  how 
best  to  proceed.  The  criteria  for  tier  advancement  or  for  reaching  a  conclusion  of  no  sig¬ 
nificant  risk  should  be  specified  to  the  extent  possible  during  risk  assessment  planning. 
Data  needs  for  subsequent  tiers  are  refined  after  the  results  of  preceding  tiers  are  ana¬ 
lyzed. 

A  number  of  issues  differentiate  SERAs  conducted  for  waste- contaminated  sites  from 
those  conducted  for  other  reasons  and  detailed  in  other  chapters  evaluation  of  new 
products  in  Chapters  6  and  7,  or  evaluation  of  sediments  for  dredge  spoils  relocation  in 
Chapter  5).  These  issues  include  the  following: 

have  occurred.  However,  the  evaluation  of  cleanup  or  remediation  alternatives 
requires  predictive  or  prospective  evaluation  techniques. 

•  Assessments  for  waste-contaminated  sites  are  interactive  in  nature,  requiring 
ongoing  interaction  and  communication  between  the  risk  assessor  and  the  risk 
manager.  This  requires  that  the  risk  manager  be  an  integral  part  of  the  assess¬ 
ment  process  from  the  outset. 

•  Because  assessments  at  waste-contaminated  sites  require  interaction  with  a 
variety  of  disciplines  and  groups  other  than  environmental  toxicologists  and 
chemists,  including  but  not  restricted  to  remediation  engineers,  lawyers, 
hydrogeologists,  and  the  public,  it  is  imperative  that  the  approach  used  and 
interpretation  of  results  and  conclusions  be  easily  understandable  and  thor¬ 
oughly  presented. 

This  chapter  provides  guidance  on  conducting  SERAs  for  waste-contaminated  sites,  in¬ 
cluding  refinement  or  modification  of  generic  ERA  procedures  (Chapter  1).  This  guid¬ 
ance  is  not  intended  to  be  inflexible;  to  the  contrary,  flexibility  is  encouraged  given  the 
site-  and  situation-specificity  of  contaminated  sites.  However,  we  caution  investigators  to 
remember  that  sediment  evaluations  (or  other  investigations)  typically  focus  on  only  a 
snapshot  in  time,  and  both  temporal  and  spatial  shifts  must  be  considered  as  they  may 
affect  redistribution  of  contaminants  or  result  in  modifications  to  biological  communi¬ 
ties  which  may  or  may  not  be  related  to  contaminants  in  sediments. 

Further,  note  that  the  current  state-of-the-art  for  evaluating  sediment  effects  is  in  its  in¬ 
fancy.  This  is  particularly  true  for  mechanistic  understanding  of  sediment  toxicity.  Thus, 
presently,  the  only  possible  approach  is  primarily  empirical.  With  the  development  of  the 
science  and  of  our  knowledge  of  mechanistic  relationships,  this  will  no  doubt  change,  but 
an  empirical  approach  will  probably  always  play  a  major  role  in  SERA. 


Assessments  for  waste-contaminated  sites  are  typically  retrospective,  being 
conducted  after  wastes  have  been  deposited  and  often  after  ecological  effects 
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7.2.1  Initial  problem/issue  definition 

Background  information  and  preliminary  sampling  data  are  used  to  identify  the  problem 
and  define  the  issues  that  need  to  be  addressed  at  a  contaminated  site.  At  this  stage  of  the 
sediment  assessment,  a  number  of  aspects  should  be  considered  in  formulating  the  prob¬ 
lem  and  developing  the  conceptual  model  for  the  site.  These  include  source  definition, 
consideration  of  the  potential  effects  of  the  identified  contaminants,  and  initial  identifi¬ 
cation  and  characterization  of  potential  receptors. 

7.2.1. 1  Source  definition 

Source  definition  at  contaminated  sites  is  important  because 

•  from  a  legal  perspective,  it  establishes  linkages  to  potential  contributors  (i.e.,  to 
the  problem)  who  might  participate  in  the  investigation  or  remediation  of  the 
site;  and 

•  from  a  scientific  perspective,  it  identifies  the  form  of  discharge  and  hence  the 
fate  and  transport  processes  linking  land-based  contamination  to  aquatic 
systems,  including  sediments  at  one  location  with  other  areas  where  exposure 
could  occur.  It  is  therefore  the  first  step  in  the  conceptual  model.  Note  that  areal 
and  temporal  dimensions  are  both  important. 

In  addition,  source  identification  may  help  to  focus  the  analyses  for  contaminants  or 
limit  the  number  of  contaminants  of  concern  if  historical  information  about  the  site  is 
available. 

The  identification  of  sources  of  sediment  contamination  generally  involves  an  examina¬ 
tion  of  the  operational  histories  of  private  or  public  facilities  that  may  have  contributed 
directly  to  contaminated  sediments  or  to  land-based  contamination  that  has  or  might 
migrate  to  the  sediments.  The  process  can  become  complicated  by  the  presence  of  many 
different  historical  or  current  sources.  In  such  cases,  the  site  investigators  and  managers 
must  decide  how  to  define  sources.  One  option  is  to  define  sources  in  terms  of  a  sediment 
contamination  problem  within  the  aquatic  system.  In  such  cases,  a  number  of  sources 
may  be  identified.  Another  option  is  to  differentiate  a  particular  source  from  ubiquitous 
or  upstream  contamination  associated  with  other  point  and  nonpoint  sources  (i.e.,  back¬ 
ground). 

An  additional  set  of  considerations  involves  fate  and  transport  processes  that  may  link 
land-based  contamination  or  in-place  sediments  with  other  areas  where  exposure  may 
occur.  The  following  processes  are  important  to  consider  at  this  stage: 

•  Surface  erosion  and  bulk  transport  of  contaminants  present  in  surface  soils  at  a 
site 

•  Seep  discharge  of  non-aqueous  phase  liquids  (NAPL)  present  at  a  site  or  along 
the  bank  of  a  water  body  which  may  be  denser  than  water  (D-NAPL)  or  lighter 
than  water  (L-NAPL),  which  could  enter  the  aquatic  environment  by  migrating 
along  geological  features  or  on  ground  water  and  contaminate  sediments 
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•  Contaminated  groundwater  from  a  site  which  could  discharge  to  an  adjacent 
water  body  and  result  in  exposure  to  benthic  organisms  or  result  in  contamina¬ 
tion  of  sediments 

•  Resuspension  of  contaminated  sediments  at  or  below  the  sediment-water 
interface  (sometimes  by  infrequent  storm  or  flood  events),  such  that  they 
become  exposed  to  the  surface  or  are  transported  to  other  areas 

Each  of  these  considerations  can  affect  how  the  problem  and  sources  are  defined  at  a 
contaminated  site  and  how  the  risk  assessment  is  implemented.  Each  may  lead  to  differ¬ 
ent  remedial  strategies. 

7.2. 7.2  Potential  effects  of  contaminants 

Contaminants  vary  in  their  environmental  behavior  and  effects.  These  characteristics  will 
affect  how  an  SERA  is  structured.  At  the  problem  formulation  stage,  it  is  important  to 
consider  characteristics  of  individual  chemicals  or  classes  of  chemicals  such  as  persis¬ 
tence,  hydrophobicity,  potential  for  food  chain  transfer,  and  toxic  effects  on  receptors. 
Most  chemicals  that  tend  to  sorb  to  sediments  can  be  placed  into  one  of  the  general  cat¬ 
egories  shown  in  Table  7-1. 

7.2. 7.3  Identification  and  characterization  of  potential  receptors 

Identification  of  potential  receptors  depends,  in  part,  on  the  habitat  and  contaminant. 
An  SERA  should  consider  all  receptors  potentially  exposed  to  sediment  contaminants 
either  directly  or  indirectly.  For  sediment  assessments,  the  benthic  invertebrate  commu¬ 
nity  is  often  identified  as  a  receptor  along  with  demersal  fish  species.  If  chemicals  (e.g., 
chlorinated  pesticides,  PCBs,  certain  mercury  compounds)  have  a  potential  for  transfer 
from  sediments  via  food  webs,  then  pelagic  fish  species  (e.g.,  lake  trout)  and  other  wild¬ 
life  (diving  birds  and  fish-eating  mammals)  may  be  considered  in  the  risk  assessment. 
Typically,  wading  birds  and  mammals  that  feed  on  benthic  invertebrates  or  may  be  ex¬ 
posed  directly  to  sediments  are  considered  for  shallow  water  sediments.  Plants  may  also 
be  considered  as  potential  receptors. 

The  process  of  characterizing  receptors  usually  involves  a  combination  of  reconnaissance 
survey  work  (site  visits)  together  with  natural  history  information.  Depending  on  the 
needs  of  the  risk  assessment,  additional  data  on  receptors  may  be  gathered  at  later  stages 
of  the  evaluation. 

7.2. 7.4  Summary 

The  product  of  the  initial  problem/issue  definition  should  be  a  clear  statement  of  the 
nature  of  the  sources  at  the  site,  potential  issues  concerning  migration,  potential  con¬ 
taminants  of  concern  at  the  site,  potential  effects  associated  with  these  chemicals,  and 
receptors  that  should  be  considered.  This  information  may  be  focused  further  as  the  as¬ 
sessment  proceeds.  Alternatively,  additional  sources,  contaminants,  or  receptors  may  be 
added  as  additional  information  is  gathered.  However,  the  primary  objective  should  be 
to  gather  and  organize  information  in  a  manner  that  can  lead  to  the  development  of 
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Table  7-1  General  categories  of  chemicals  sorbing  to  sediments  (matrix  would  be  filled  in  for  specific 
studies) 


Bioaccumulation 

EiFects 

Limited 

bioaccumulation  in 
invertebrates 

Bioaccumulates  in 
invertebrates  but 
limited  food  chain 
transfer 

Bioaccumulates  and  can 
be  transferred  via  food 
chains 

Lethal  and  sublethal 
effects  upon  direct 
contact 

[Relatively  most  serious] 

Lethal  and  sublethal 
effects  when 
bioaccumulated 

Sublethal  effects  when 
bioaccumulated 

clearly  stated  assessment  endpoints.  All  parties  involved  should  be  in  agreement  with  the 
clearly  stated  problem  definition. 

7.2.2  Conceptual  model  development 

Development  of  the  conceptual  model  as  part  of  the  problem  formulation  process  is  an 
iterative  process.  Initially,  one  conceptualizes  how  the  contaminants  may  have  moved 
both  physically  and  biologically,  their  exposure  pathways,  receptors  that  might  be  ex¬ 
posed,  and  mechanisms  for  toxic  effects  in  order  to  determine  the  primary  concerns  or 
priorities  for  what  needs  to  be  protected  {i.e.,  assessment  endpoints).  As  one  progresses 
through  problem  formulation,  the  conceptual  model  is  refined  to  establish  potential  ex¬ 
posure  pathways  and  receptors,  appropriate  measurement  endpoints,  and  a  study  design 
to  meet  data  needs.  The  finalized  conceptual  model  serves  as  input  to  the  analysis  phase 
of  the  assessment. 

The  outcome  of  the  problem  formulation  process  is  a  conceptual  model  describing  how 
a  stressor  might  affect  ecological  components  at  the  site.  The  conceptual  model  identifies 
the  exposure  pathways  and  specific  adverse  effects  (ecological  effects)  which  will  be  evalu¬ 
ated  in  the  SERA.  The  model  traces  contaminant  physical-chemical  fate  and  transport, 
including  food  chain  transmission,  and  defines  potential  adverse  effects.  It  allows  the  risk 
assessor  to  evaluate  the  exposure  pathway  to  potential  receptors,  particularly  related  to 
the  assessment  endpoints,  to  ensure  that  the  exposure  pathway  is  complete.  Further,  it 
describes  the  relationship  among  assessment  and  measurement  endpoints  and  is  used  to 
confirm  that  the  selected  measurement  endpoints  are  in  the  same  exposure  pathway  as 
the  assessment  endpoints. 
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The  conceptual  model’s  underlying  principle  is  that  it  is  representative  of  the  critical  ex¬ 
posure  pathways  and  the  ecotoxicity  threats  posed  to  specific  trophic  levels  or  other  eco¬ 
system  components.  Identification  of  these  exposure  pathways  is  analogous  to  food  chain 
energy  transfer  models.  For  particular  species  some  food  sources  are  of  greater  value  and 
some  food  items  are  particularly  high  in  energy  value.  These  same  concepts  are  true  for 
contaminant  transfers  in  ecosystems.  Sources  are  identified,  physical-chemical  transport 
mechanisms  are  identified,  and  exposures  to  receptors  are  evaluated  based  upon  the  rate 
of  utilization  (duration  of  exposure),  modes  of  toxicity,  and  contaminant  transfer  effi¬ 
ciency  (bioavailability). 

Thus,  the  site  conceptual  model  “represents”  the  exposure  pathways  and  receptors  (as¬ 
sessment  endpoints)  which  are  to  be  evaluated  in  the  SERA.  At  any  given  site,  the  appro¬ 
priate  assessment  endpoints  may  involve  individual  endangered  species,  local 
populations  of  particular  species,  community-level  integrity,  or  habitat  preservation.  The 
site  conceptual  model  must  encompass  the  level  of  biological  organization  appropriate 
for  the  assessment  endpoints  for  the  site.  It  may  or  may  not  be  completely  realistic  rela¬ 
tive  to  the  measurement  endpoints;  strict  “reality”  in  the  model  relative  to  the  measure¬ 
ment  endpoints  can  undermine  the  utility  of  the  model  for  evaluation  of  risk  to  the 
assessment  endpoint. 

Development  of  a  conceptual  model  also  identifies  data  requirements,  methodologies 
needed  to  analyze  the  data  (study  design),  and  points  or  assumptions  including  the  great¬ 
est  degree  of  uncertainty.  By  identifying  these  conservative  assumptions,  effort  can  be 
focused  in  the  study  design  to  address  information  gaps  or  sources  of  uncertainty, 
thereby  utilizing  site-specific  information  to  minimize  over-  or  underestimation  of  the 
actual  ecological  risks.  Typically,  this  translates  into  direct  field  evaluation  of  contami¬ 
nant  concentrations  at  exposure  points  and  of  bioavailability  or  toxicity  of  contaminants 
in  sediments. 

7.2.3  Selecting  and  defining  assessment  endpoints 

Assessment  endpoints  for  sediments  at  contaminated  sites  establish  the  overall  direction 
and  focus  of  the  risk  assessment.  Assessment  endpoints  are  established  through  discus¬ 
sions  among  the  risk  assessor,  risk  manager,  and  others  {e.g.,  the  public)  who  may  be  in¬ 
terested  in,  or  affected  by,  the  outcome  of  the  decisions.  In  some  cases  —  where  the 
problems  are  simple  or  well  understood  —  it  may  be  possible  to  proceed  based  on  past 
experience  without  extensive  discussions. 

Assessment  endpoints  reflect  what  human  beings  are  concerned  with  or  care  about,  ex¬ 
pressed  in  a  manner  that  can  be  evaluated  through  an  objective  scientific  process.  Such 
endpoints  generally  include  but  are  certainly  not  restricted  to  ecological  concerns.  As¬ 
sessment  endpoints  are  most  useful  when  expressed  in  terms  of  a  specific  receptor  (spe¬ 
cies,  habitat,  system)  and  a  specific  function  or  quality  that  is  to  be  maintained  or 
protected.  An  overly  broad  assessment  endpoint  (c.^.,  “health  of  the  environment”)  can 
be  difficult  to  evaluate.  In  such  cases,  the  assessor  may  find  it  helpful  to  disaggregate  the 
broad  assessment  endpoint  into  multiple  assessment  endpoints  that  consider  more  spe- 
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cific  characteristics  of  the  environment  or  species  under  consideration.  In  general,  this 
process  involves  professional  judgment  and  an  appreciation  of  what  is  needed  to  commu¬ 
nicate  the  analysis  effectively  to  the  risk  manager  and  to  other  interested  parties. 

At  contaminated  sites,  more  than  one  assessment  endpoint  is  typically  considered.  The 
exact  number  depends  on  the  chemicals  and  receptors  present. 

7.2.3. 7  Selecting  assessment  endpoints  for  sediments 

The  selection  of  assessment  endpoints  is  a  critical  communication  step  among  the  risk 
assessor,  risk  manager,  and  others  interested  in  the  outcome  of  the  assessment  (e.g,, 
stakeholders).  The  objective  is  to  reach  a  consensus  on  what  the  assessment  endpoints 
will  be.  The  site  manager,  with  assistance  from  the  risk  assessor/scientist,  is  typically 
responsible  for  resolving  issues  that  may  arise  among  various  involved  parties. 

The  process  of  selecting  assessment  endpoints  (once  the  endpoints  have  been  defined) 
usually  starts  with  the  risk  assessor/scientist  and  comprises  the  initial  site  conceptual 
model.  Based  on  a  general  knowledge  of  the  chemicals,  receptors,  and  site,  the  assessor/ 
scientist  usually  develops  an  initial  set  of  assessment  endpoints  for  consideration  by  the 
risk  manager  and  other  interested  parties.  For  simple  or  well-understood  sites,  this  typi¬ 
cally  involves  a  discussion  between  the  risk  assessor  and  site  manager. 

This  initial  set  of  endpoints  forms  the  basis  for  discussions  that  may  result  in  additions, 
deletions,  or  modifications  to  the  list  of  assessment  endpoints.  The  process  is  iterative. 
The  risk  assessor/scientist  often  plays  a  key  role  in  informing  other  parties  about  the 
ecological  or  human  health  “relevance”  of  various  assessment  endpoints,  and  helps  focus 
the  assessment.  Other  assessment  endpoints  or  modifications  to  endpoints  which  may  be 
suggested  by  the  site  manager  or  others  need  to  be  discussed  and  a  decision  made  by  the 
site  manager  (not  risk  assessor)  regarding  inclusion,  exclusion,  or  modification. 

7.2.3.2  Defining  assessment  endpoints 

Assessment  endpoints  should  be  defined  in  terms  of 

•  a  receptor  (species,  community,  other  level  of  organization)  and 

•  a  characteristic  or  function  (e.g.,  survival,  maintenance,  reproduction)  to  be 
protected. 

With  regard  to  contaminated  sediments,  there  are  several  categories  of  assessment  end¬ 
points  that  might  be  considered  depending  on  the  receptor  and  chemicals.  Examples  of 
assessment  endpoints  for  sites  with  contaminated  sediments  are  given  in  Table  7-2. 

7.2.4  Defining  and  selecting  measurement  endpoints 

Measurement  endpoints  are  components  of  assessment  endpoints  which  are  nonsubjec- 
tive  and  quantal  (i.e.,  measurable).  There  can  be,  and  often  are,  multiple  measurement 
endpoint  options  associated  with  an  assessment  endpoint.  Measurement  endpoints  are 
linked  to  assessment  endpoints  by  the  mechanism  of  toxicity  and  by  exposure  (e.g.,  repro¬ 
ductive  success  and  tissue  concentrations  of  PCBs). 
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Table  7-2  Example  assessment  endpoints  for  sediment  ecological  risk  assessment 


Receptors 

Assessment  endpoints 

Benthic  invertebrates 

•  community  structure  and  function  as  a  reflection  of  an 

aquatic  environment  typical  for  the  water  body 

•  survival  and  maintenance  of  selected  (keystone)  species 

•  support  offish  and  wildlife  species 

Fish 

•  community  structure  and  function  as  a  reflection  of  an 

aquatic  environment  typical  for  the  water  body 

•  maintenance  of  recreational  or  commercial  fisheries 

•  survival 

•  reproductive  success 

Birds 

Feeding  on  invertebrates 

Feeding  on  fish 

•  maintenance 

•  survival 

•  reproductive  success 

Mammals 

Feeding  on  invertebrates 

Feeding  on  fish 

•  maintenance 

•  survival 

•  reproductive  success 

Humans  (usually  a  separate  assessment) 
Ingesting  contaminated  fish  or 
shellfish 

Contacting  contaminated  sediments 

•  carcinogenic  risks 

•  non-carcinogenic  risks 

•  exceedance  of  public  health  values  (loss  of  recreational  or 

commercial  fishery) 

Vegetation 

Wetlands 

Submerged  aquatic  vegetation 

•  stressed  or  lost  vegetation 

•  survival  and  maintenance  of  selected  (keystone)  species 

•  alteration  of  community  structure  and  function 

•  support  offish  and  wildlife  species 

Threatened  or  endangered  species 

•  survival  of  individuals 
•reproductive  success  of  individuals 

7.2.4.  T  Defining  measurement  endpoints 

A  measurement  endpoint  is  a  measurable  ecological  characteristic  that  is  related  to  the  as¬ 
sessment  endpoint.  Because  measurement  endpoints  are  the  bases  for  structuring  the 
analysis  phase  of  the  sediment  risk  assessment,  and  because  they  will  ultimately  be  used 
to  estimate  risk,  they  should  have  certain  features; 

•  They  should  be  related  explicitly,  either  directly  or  indirectly,  to  specific 
assessment  endpoints. 

•  They  should  include  metrics  that  can  be  used  for  estimating  risks;  these  metrics 
incorporate  both  test  or  study  “endpoints”  {e.g.,  toxicity  results,  tissue  levels, 
and  community  structure)  with  nonsubjective  scaler  functions  or  values  that 
will  be  used  to  judge  the  response  at  the  risk  characterization  stage  {le.,  the 
response  variable). 
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One  or  more  of  the  following  scales  might  be  included  as  metrics  for  the  measurement 
endpoint: 

•  A  change  or  difference  in  the  response  variable  that  is  considered  potentially 
ecologically  relevant  {e.g.,  degree  of  mortality  in  a  test,  or  change  in  abundance 
or  biomass) 

•  Spatial  scale  of  the  change  or  difference  as  related  to  the  assessment  endpoint 
{e.g.,  hectares,  fraction  of  foraging  area,  fraction  of  area  utilized  by  a  local 
population) 

•  Temporal  scale  of  the  change  or  difference  as  related  to  the  assessment  endpoint 
(duration;  changes  with  time,  with  and  without  major  events  such  as  storms  or 
floods;  rate  of  recovery) 

The  measurement  endpoint  might  also  include  some  statement  of  statistical  confidence, 
although  this  is  not  always  possible. 

A  hypothetical  example  of  the  relationships  among  an  assessment  endpoint,  measure¬ 
ment  endpoints,  and  measurement  tools/results  is  given  in  Table  7-3.  The  statements  and 
values  used  in  the  assessment  and  measurement  endpoint  descriptions  are  used  only  for 
illustration  and  do  not  reflect  any  scientific  or  managerial  consensus.  In  a  real-world  situ¬ 
ation,  careful  consideration  would  be  given  to  how  the  measurement  endpoints  should 
be  developed  and  stated. 

7.2.4.2  Selecting  measurement  endpoints 

In  selecting  measurement  endpoints  for  evaluating  contaminated  sediments  at  sites,  a 
number  of  issues  should  be  addressed,  including  the  following: 

1)  The  relationship  between  the  assessment  and  measurement  endpoint 

a)  the  degree  of  association  between  measurement  and  assessment  endpoints 
can  vary  from  direct  to  indirect,  more  confidence  is  given  to  measurement 
endpoints  that  are  more  closely  related  to  or  provide  an  indication  of  re¬ 
sponse  tied  to  the  assessment  endpoint  (for  instance,  exposure  mechanism 
and  mechanism  of  toxicity) 

b)  the  degree  to  which  the  measurement  endpoint  can  be  linked  to  the  chemical 
stressors  of  concern  in  a  stress-response  relationship 

c)  the  availability  of  objective  criteria  forjudging  or  evaluating  the  ecological 
relevance  of  changes  in  the  measurement  endpoint 

2)  The  quality  of  the  data  that  can  be  developed  for  the  measurement  endpoint 
a)  the  need  to  minimize  both  Type  I  and  Type  II  errors  in  the  selected  measure¬ 
ment  endpoint 

3)  Study  design  considerations  for  measurement  endpoint  selection 

a)  the  extent  to  which  the  measurement  endpoint  (the  test  as  well  as  the  scaler 
components)  relies  upon  site-specific  information 
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Table  7-3  Example  assessment  and  measurement  endpoints 


Measurement  tools  and  test 

Assessment  endpoint  Measurement  endpoints^  endpoints  (results) 


Maintenance  of  keystone  •  Abundance  and  biomass  of  the 
benthic  amphipod  species  species  as  compared  to  that  at 
important  in  diet  of  local  appropriate  reference  areas;  a 

groundfish  “reduction”  or  difference  of _ % 

that  appears  related  to  presence  of 
contaminants  is  considered 
ecologically  relevant  at  spatial 

scales  of _ to _ hectares  for 

a  period  of _ years;  desired 

level  of  statistical  confidence  is  0.10 


Benthic  community  studies  at  the 
site  and  in  one  or  more  reference 
areas;  results  include  species 
identifications,  enumeration,  and 
biomass 


•  Mortality  of  amphipods  in  a  10-d  •  10-d  toxicity  test  with  amphipod; 

whole-sediment  test;  mortality  results  expressed  as  %  mortality 

>25%  related  to  presence  of 
contaminants  is  considered 
potentially  ecologically  relevant  at 
spatial  scales  of _ hectares 

^  The  specificity  of  this  example  is  intended  to  be  illustrative,  not  constrictive. 


b)  the  sensitivity  of  the  measurement  endpoint  for  detecting  changes  or  effects 
of  interest;  involves  a  consideration  of  test  variability  and  natural  variability 

c)  the  spatial  representativeness  of  the  measurement  endpoint  relative  to  the 
assessment  endpoint 

d)  the  temporal  representativeness  of  the  measurement  endpoint  relative  to  the 
assessment  endpoint 

e)  the  ability  of  the  measurement  endpoint  to  provide  quantitative  information 
(measurement  endpoints  can  be  qualitative) 

f )  the  extent  to  which  the  measurement  endpoint  is  based  on  standard  methods 
(Measurement  endpoints  can  include  procedures  that  are  in  the  developmen¬ 
tal  or  research  arena  but  there  may  be  less  confidence  in  such  methods.) 

g)  the  need  for  multiple  measurement  endpoints 
4)  Other  factors 

a)  site  schedule/phasing:  Overall  schedules  for  the  site  investigation  and 

feasibility  study  may  dictate  the  types  of  measurements  that  can  be  made  and 
when  they  can  be  made;  external  schedule  constraints  must  be  considered 
when  planning  the  SERA  and  selecting  measurement  endpoints  for  which  site 
data  are  desired;  in  some  cases,  there  may  not  be  an  opportunity  to  conduct 
the  assessment  in  phases,  and  the  risk  assessor  may  need  to  select  more 
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measurement  endpoints  up  front  than  might  be  the  case  if  the  assessment 
were  phased. 

b)  resources:  Measurement  endpoints  vary  in  implementation  costs;  assess¬ 
ments  are  often  resource  constrained,  and  in  such  cases,  measurement 
endpoints  must  be  selected  carefully  to  insure  the  most  useful  information  for 
the  available  budgets;  measurement  endpoints  should  also  be  appropriate  for 
the  nature  and  size  of  the  problem. 

c)  imminent  hazard  evaluations:  Relative  to  severity  of  impact  or  potential 
impact,  at  some  sites  there  may  be  concern  regarding  imminent  hazards;  in 
such  cases,  some  measurement  endpoints  may  be  selected  to  determine 
whether  such  a  hazard  exists;  these  endpoints  typically  permit  rapid  collec¬ 
tion  and  evaluation  of  data. 

d)  physical  limitations:  Hard-bottom  streams,  e.g.,  obviate  sediment  toxicity 
tests. 

7.2.5  Study  design 

The  study  design  incorporates  the  selected  measurement  endpoint  tools  and  the  site  con¬ 
ceptual  model  within  the  data  rigor  needed  for  extrapolation  from  the  measurement  end¬ 
points  to  the  assessment  endpoints.  As  such,  it  is  basically  the  plan  for  the  collection  of 
field  samples  which  serves  to  generate  the  measurement  endpoints.  In  addition,  study 
design  incorporates  elements  of  the  analysis  phase,  in  particular  the  risk  characteriza¬ 
tion. 

The  number  of  sample  locations  and  number  of  replicate  samples  which  must  be  taken 
to  answer  the  questions  developed  through  the  problem  formulation  phase  are  specified 
in  the  study  design.  This  inherently  requires  an  evaluation  of  statistical  and  logistical  is¬ 
sues,  for  example.  Can  replicate  samples  really  be  obtained?  or  How  many  observations 
will  be  require  to  meet  the  data  requirements?  Consultation  with  an  experienced  biostat¬ 
istician  is  highly  recommended,  particularly  to  determine  the  necessary  statistical  power 
to  resolve  differences  between  sampling  locations  or  exposure  levels.  Note  that  one  of  the 
major  benefits  of  involving  a  biostatistician  in  the  early  stages  of  study  design  is  that  cer¬ 
tain  measurements  may  be  shown  to  be  worthless  due  to  the  excessively  high  number  of 
samples  required  to  show  a  difference,  while  others  may  be  shown  to  be  highly  effective 
and  worthy  of  substantive  additional  effort  (Chapter  18) . 

If,  for  instance,  it  is  determined  that  the  number  of  samples  required  to  answer  a  specific 
question  is  not  obtainable,  either  due  to  physical  sampling  limitations  or  resource  limi¬ 
tations,  then  the  measurement  endpoint  and  the  measurement  endpoint  tools  must  be 
reevaluated.  Collection  of  samples  which  are  insufficient  to  answer  the  questions  arising 
from  problem  formulation  is  wasted  effort,  since  the  data  will. not  be  interpretable  in  the 
manner  required  to  reach  a  management  decision.  If  sample  size  alterations  occur  due  to 
field  conditions,  the  risk  manager  must  be  consulted  since  failure  to  collect  sufficient  data 
may  translate  into  an  inability  to  obtain  a  measurement  endpoint  and  hence  can  affect 
the  evaluation  of  the  assessment  endpoint. 
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Either  the  sampling  locations  or  the  mechanism  by  which  sampling  locations  will  be 
identified  is  specified  in  the  study  design.  For  example,  if  a  study  requires  samples  to  be 
collected  along  a  concentration  gradient  to  develop  an  exposure-response  curve  and  de¬ 
tailed  contaminant  distribution  data  exist,  specific  locations  could  be  targets  for  sample 
collection.  Alternatively,  if  little  information  exists  on  contaminant  distribution  but  there 
is  a  field  screening  technique  for  the  contaminant  of  interest  or  a  co-distributed  material, 
the  study  design  could  specify  the  concentrations  required  such  that  the  sampling  loca¬ 
tions  can  then  be  determiner!  in  the  field. 

It  may  be  advantageous  to  develop  an  exposure-response  relationship  in  association  with 
the  measurement  endpoint  tools.  While  it  is  frequently  difficult  at  best  to  determine  the 
actual  dose  received  by  organisms,  it  may  be  possible  to  estimate  exposure  levels  (expo¬ 
sure  point  concentrations)  which  can  then  be  related  to  measurement  tool  observations. 
For  example,  if  the  measurement  tool  being  employed  is  the  tissue  concentration  of  resi¬ 
dent  species,  then  representative  media,  sediment,  or  water  samples  could  be  collected 
within  the  home  range  of  the  organisms  sampled  for  tissue  contaminant  levels.  This  can 
be  repeated  at  several  media  concentration  levels.  From  these  data,  an  exposure  point 
(media  concentration)  to  tissue  concentration  relationship  could  be  drawn. 

Sediment  sampling  requires  an  evaluation  of  the  depth  of  sediment  to  be  collected,  which 
may  vary  based  upon  the  assessment  endpoints  and  thereby  the  measurement  end¬ 
points.  For  example,  if  the  measurement  endpoint  is  the  laboratory  toxicity  associated 
with  the  aerobic  sediment  at  the  sediment-water  interface,  then  only  the  uppermost  layer 
of  the  sediment  should  be  collected.  Alternately,  if  the  measurement  endpoint  is  the  labo¬ 
ratory  toxicity  of  the  biologically  active  layer  of  the  sediment,  then  the  upper  10  cm  may 
be  the  appropriate  sampling  depth.  Knowledge  of  the  contaminant  distribution  in  the 
sediment  should  always  be  incorporated  into  the  decision  of  the  appropriate  sampling 
depth.  For  example,  if  information  indicates  that  the  contaminated  portion  of  the  sedi¬ 
ments  is  below  5  cm,  then  inclusion  of  the  upper  5  cm  in  the  sample  could  dilute  the 
sample,  potentially  resulting  in  erroneous  conclusions. 

Synoptic  sampling  is  required  to  provide  a  temporal  link  between  all  data  collected.  Syn¬ 
optic  sampling  can  also  provide  for  cost  savings,  in  that  data  can  be  used  for  multiple 
purposes.  For  most  media  {e.g.,  sediment),  chemical  data  generated  can  be  used  for  all 
other  tools  being  employed. 

The  study  design  must  be  critically  evaluated  to  ensure  that  data  generated  will  fully  an¬ 
swer  all  of  the  specific  questions  developed  through  problem  formulation.  In  particular, 
the  measurement  endpoint  tools  must  be  relevant  to  the  concerns  associated  with  the 
assessment  endpoint. 

7.3  Analysis  phase 

7.3.1  Exposure  assessment 

The  goal  of  any  exposure  assessment  is  to  estimate,  to  the  degree  of  accuracy  needed,  the 
zone  of  influence  of  potential  concern  in  the  environment  or  the  bioavailability  of  chemi- 
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cals  to  organisms.  Exposure  assessments  include  analysis  of  the  magnitude,  duration, 
and  frequency  of  exposure,  based  on  data  for  1)  chemical  sources,  2)  chemical  distribu¬ 
tions  in  media  including  transformations,  and  3)  spatial-temporal  distributions  of  key 
receptors.  Available  data  are  used  initially  for  an  exposure  assessment,  and  focused  stud¬ 
ies  may  be  conducted  to  collect  data  for  subsequent  tier  assessments.  A  variety  of  meth¬ 
ods  or  tools  exist  to  allow  the  assessor  to  estimate  potential  exposures.  These  tools  each 
have  varying  degrees  of  accuracy,  ecological  relevance,  and  inherent  uncertainty  (Chap¬ 
ter  18).  For  example,  a  mechanistic  model  can  provide  estimates  of  water  concentrations; 
however,  its  accuracy  depends  on  the  inherent  understanding  of  the  environmental  pro¬ 
cesses  on  which  the  model  is  based  and  the  validity  of  the  assumptions  used  in  the  model 
scenario.  Alternately,  empirical  measurement  of  the  chemicals  of  concern  at  a  given  site 
is  perhaps  a  less  uncertain  method  for  evaluating  exposure,  because  it  permits  the  direct 
evaluation  of  exposure  at  a  given  location.  However,  it  has  the  drawback  of  being  labor 
intensive  and  costly,  and  bulk  concentration  does  not  directly  translate  to  exposure. 

The  evaluation  of  exposure  is  perhaps  the  most  unique,  and  the  most  uncertain,  aspect  of 
conducting  ERAs  for  contaminated  sediments.  A  wealth  of  information  is  available  sug¬ 
gesting  that  evaluation  of  contaminant  concentrations  in  whole  sediments  may  not  be 
sufficient  to  address  the  question  of  bioavailability  and,  thus,  risk  of  adverse  effects.  This 
is  reflected  in  recent  regulatory  strategies  for  developing  sediment  criteria  for  both  non¬ 
polar  organic  materials  (USEPA 1993)  and  metals  (USEPA  1994b).  In  both  cases,  empiri¬ 
cal  data  suggest  that  porewater  contaminant  concentrations  more  accurately  predict 
observed  toxicity  and  community-level  effects  than  do  whole  sediment  concentrations. 
This  is  chiefly  due  to  mitigating  or  modifying  factors  {e.g.,  OC,  AVS)  that  affect  the  bio¬ 
availability  of  sediment-associated  contaminants.  On  the  other  hand,  depending  on  the 
degree  of  rigor  necessary  {i.e.,  the  tier  of  the  assessment),  the  use  of  less  rigorous,  more 
uncertain  methods  may  be  all  that  is  needed. 

A  variety  of  tools  are  available  for  estimating  exposures  in  sediments.  These  tools  and  a 
brief  description  of  their  advantages  and  disadvantages  are  provided  in  Table  7-4. 

The  decision  to  use  a  particular  tool  in  assessing  exposure  must  be  based  on  a  number  of 
factors,  including  these: 

•  Type  of  contaminants  present 

•  Inherent  effects  or  concerns  associated  with  the  contaminants-of-concern  {e.g. , 
direct  acting  toxicants  versus  bioconcentratable  materials) 

•  Level  of  assessment  {i.e. ,  screening-level  assessment  versus  advanced  tier) 

•  Quantity  and  quality  of  available  data 

In  most  instances,  a  battery  of  tools  must  be  employed  in  conducting  the  assessment; 
however,  the  degree  to  which  any  or  all  tools  will  be  used  may  vary. 

7.3.2  Effects  assessment 

The  goal  of  the  effects  assessment  is  to  provide  information  on  toxicity  or  other  effects 
that  form  integral  components  of  the  measurement  endpoint.  Effects  assessment  tools 
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Table  7-4  Exposure  assessment  tools 

Exposure  assessment 

method  Advantages  Limitations 

Empirical  estimates 

Whole  sediment  Extremely  valuable  for  retrospective  Requires  that  assessor  dictate  chemicals 

concentrations  screening-level  assessment.  Permits  one  of  of  concern  up  front  and  requires  that  a 
most  direct  procedures  for  determining  sampling  design  of  sufficient  statistical 

distribution  and  quantification  of  rigor  exist.  Can  be  costly;  a  priori 

sediment  contaminante.  Valuable  for  information  should  be  used  to  quantify 
evaluation  of  bioaccumulative  number  of  samples  needed, 

contaminants. 

Porewater  May  provide  more  direct  estimation  of  Requires  more  difficult  sampling 

concentrations  actual  exposure  than  whole  sediment  procedures  that  can  be  limited  in  certain 

concentrations.  types  of  sediments  and  can  be  more 

costly. 

Overlying  water  Can  prove  valuable  for  retrospective  May  not  be  appropriate  for  all  types  of 

concentrations  screening-level  assessments,  especially,  in  sediments.  Water  flow  rates  may  result 

cases  where  sediments  are  “coarse  in  dilution  to  levels  below  detection 

grained”,  Le.,  overlying  waters  may  limits, 

appreciably  affect  porewater 
concentrations  or  in  those  cases  where  it 
is  anticipated  that  sediments  may  serve  as 
a  “source”  for  water  contamination. 

SEM  Provides  most  direct  empirically  derived  Requires  difficult  analytical  procedures, 

concentrations  estimation  of  actual  exposure  for  metals  Samples  must  be  obtained  using 

in  contaminated  sediments.  Only  rigorous  procedures  and  can  be  costly, 

applicable  in  those  cases  where  metals  are  Relates  only  to  divalent  metals.  See  pore 
a  contaminant  of  concern.  See  pore  water,  water. 

Tissue  Can  be  most  direct  method  for  estimating  Can  be  difficult  or  impossible  for  species 

concentrations  tissue  bioaccumulation.  Will  allow  that  are  difficult  to  capture  or  cannot  be 

separation  of  tissue  concentrations  to  practically  obtained.  Generally,  sample 

edible  and  inedible  tissues.  Permits  numbers  are  small  and  can  limit 

species-specific  evaluation  of  tissue  statistical  rigor.  Prediction  of  adverse 

concentrations  and  allows  direct  effects  from  these  data  must  have  its 

evaluation  of  food  chain  transfer.  basis  in  controlled  experiments  where  a 

clear  exposure-response  relationship  has 
been  established;  few  such  data  currently 
exist. 

Biomarkers  Can  be  useful  in  addressing  question  of  Generally,  nonspecific  for  contaminants. 

exposure  and  bioavailability.  Can  be  May  not  provide  an  exposure-response 

extremely  sensitive.  relationship.  Requires  understanding  of 

mechanisms  and  must  be  related  to  a 
quantifiable  adverse  effect. 
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Table  7-4  continued 


Exposure  assessment 
method 

Advantages 

Limitations 

Modeled  /  calculated  estimates 

TOC/  AVS-  Normalization  of  whole  or  SEM  sediment 

corrected  concentrations  on  the  basis  of  TOC  or 

concentrations  AVS  can  be  useful  in  providing  sediment- 
specific  estimate  of  bioavailability  and 
receptor  exposure. 

There  is  error  in  these  estimates. 

Fate  and  transfer 

mechanistic 

models 

Can  be  important  tools  in  conducting 
predictive  ERAs,  May  provide  one  of  the 
few  tools  available  for  estimating  spatial 
or  temporal  shifts  in  contaminants. 

Require  accurate  understanding  of 
various  fate-and-transport  processes  that 
affect  contaminant  concentrations. 
Outputs  are  only  as  accurate  as 
assumptions  used  to  derive  them. 
Accuracy  of  modeled  results  may  be 
directly  correlated  with  quantity  and 
quality  of  available  data. 

Fate  and  transfer/ 

probabilistic 

models 

Can  be  important  tools  in  conducting 
predictive  ERAs  and  may  provide  one  of 
few  tools  available  for  estimating  spatial 
or  temporal  shifts  in  contaminants. 

When  sufficient  data  are  available,  can 
provide  powerful  tool  based  on  site- 
specific  empirical  data,  reducing 
uncertainty  associated  with  estimation  of 
contaminant  concentrations  and  spatial 
distribution. 

Accuracy  is  directly  related  to  available 
data  and  may  require  great  deal  of  data 
to  be  very  robust.  Predictability  often 
depends  on  quality  of  extant  data. 

Food  chain 
models 

Provide  only  method  for  estimating  tissue 
concentrations  in  certain  types  of  single 
organisms,  communities,  or  trophic 
groups.  Are  most  accurate  when  rooted  in 
empirical  data  but  can  provide 
estimations  of  bioaccumulation  or 
biomagnification  based  on  conservative 
assumptions.  In  cases  where  it  is  not 
practical  or  possible  to  directly  measure 
tissue  concentrations  in  species  (e.g. , 
threatened  or  endangered  species),  these 
models  are  only  way  to  estimate 
concentrations. 

As  with  all  models,  a  high  degree  of 
mechanistic  understanding  or  empirical 
data  will  improve  model  predictions. 
Prediction  of  adverse  effects  from  these 
data  must  have  its  basis  in  controlled 
experiments  where  a  clear  exposure- 
response  relationship  has  been 
established.  Few  data  currently  exist  that 
would  provide  this  type  of  relationship. 
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should  be  appropriate  for  the  contaminants  and  assessment  and  measurement  end¬ 
points  under  consideration.  As  described  below,  tools  range  from  numerical  sediment 
criteria  or  values  to  exposure-response  curves  or  relationships. 

Effects  assessment  tools  employed  at  earlier  tiers  or  phases  are  usually  simpler  than  for 
those  that  may  be  used  at  later  stages.  Various  tools  exist  to  allow  the  assessor  to  estimate 
potential  effects.  As  with  exposure  assessment,  these  have  varying  degrees  of  accuracy, 
ecological  relevance,  and  inherent  uncertainty  (Chapter  18). 

Effects  assessment  tools  and  a  brief  description  of  their  advantages  and  disadvantages  are 
provided  in  Table  7-5.  Biomarkers,  with  the  exception  of  histopathology  studies,  are  not 
included  because  they  are  considered  measures  of  exposure,  not  effects. 

In  general,  the  following  tools  are  used  to  evaluate  the  effects  of  sediments  on  benthic 
invertebrates: 

•  Benchmark  values  (numerical  criteria) 

•  Toxicity  studies 

•  Field  studies  of  abundance  and  community  structure 

The  following  tools  are  used  to  evaluate  effects  on  fish: 

•  Benchmark  values  (numerical  criteria) 

•  Bioaccumulation  measurements  or  estimates 

•  Toxicity  studies 

•  Field  studies  (abundance  and  community  structure,  possibly  also  histopathol- 

ogy) 

The  following  tools  are  used  to  evaluate  effects  on  birds  and  mammals: 

•  Bioaccumulation  measurements  or  estimates  in  prey  species 

•  Literature  values  on  toxic  effects 

•  Field  studies  of  effects  (environmental  epidemiology) 

7.4  Risk  characterization 

Risk  characterization  for  hazardous  waste  sites  should  not  be  done  in  isolation  but  rather 
should  consider  other  compartments  that  may  be  part  of  the  environment  as  a  whole. 
Specifically,  consideration  should  be  given  to  the  environment  outside  the  boundaries  of 
the  waste  site  and  to  potential  cumulative  impacts  from  other  sources  or  effects  related  to 
ambient  background  concentrations  of  contaminants  which  may  not  be  attributable  di¬ 
rectly  to  the  waste  site.  Risk  characterization  involves  bringing  together  exposure  and 
exposure-response  information  to  estimate  the  extent  and  nature  of  the  risk  at  a  contami¬ 
nated  sediment  site  and  the  level  of  uncertainty  associated  with  the  estimate  of  risk.  The 
study  design  (Section  7.2.5)  and  the  assessment  and  measurement  endpoints  (Sections 
7.2.3  and  7.2.4)  will  determine  the  nature  of  the  risk  characterization  and  the  level  of  un¬ 
certainty  associated  with  it  (Chapter  18).  There  are  several  key  considerations  that 
must  be  incorporated  into  the  risk  characterization  for  effective  management  decisions 
to  be  made  in  remediating  contaminated  sediments. 
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Effects  assessment 
method 


Empirical  estimates 

Benchmarks  (e.g. , 
sediment  contaminant 
numbers) 


Toxicity  tests 
(experimental) 

•  laboratory 

•  field 


Bioaccumulation  tests 
(experimental) 

•  laboratory 
•field 


Advantages 


Limitations 


Provide  simple  approaches  fi:om  Technical  bases  of  values  are  often  not  well 

comparing  sediment  concentrations  established  and  not  included  as  part  of 
(measures  of  exposure)  to  reference  pubUshing  the  numbers.  Makes  it  difficult 

values  (surrogate  measures  of  potential  to  determine  if  values  are  appropriate  for  a 
effects).  Considerable  use  of  this  particular  assessment  endpoint.  Most 

approach  around  the  world;  results  are  numbers  are  based  on  either  background 


easy  to  communicate  to  risk  managers. 
Also  requires  limited  evaluation, 
usually  involving  measures  of  chemical 
concentrations.  Employed  for 
measurement  of  both  bulk  chemicals 
and  bioavailable  fractions  of 
chemicals. 


Primary  advantage  in  lab  or  field  is 
that  it  is  possible  to  control  or  account 
for  many  variables  that  may  exist  in 
the  field  and  to  focus  on  a  specific 
attribute  of  the  sediment,  i.e.,  its 
toxicity.  Information  is  quantitative 
and  can  be  interpreted  within  context 
of  what  is  known  about  toxicity  of 
other  sediments  or  chemicals  to 
organism  used  in  test. 


values  or  protection  of  benthic 
invertebrates  and  may  not  be  appropriate 
for  other  assessment  endpoints.  Numbers 
based  on  bulk  chemical  measurements  also 
do  not  consider  effects  of  matrix  on 
bioavailabiUty.  Even  values  that  explicitly 
consider  certain  aspects  of  bioavailability 
(c.^.,  EqP)  may  miss  others  (e.g.,  physical 
sequestration). 

Major  disadvantage  is  that  they  are  a 
model  of  the  real  system  and  do  not 
include  many  factors  that  affect 
populations.  Use  of  toxicity  tests 
introduces  uncertainties  related  to 
extrapolating  results.  Potential 
disadvantage  is  that  toxicity  tests  typically 
consider  one  or  a  few  life  stages  of  a  few 
species. 


Bioaccumulation  is  used  both  to 
estimate  exposure  and  effects.  Direct 
measurements  of  bioaccumulation 
provide  information  on  potential 
effects  where  the  measurement 
endpoint  of  concern  is  a 
concentration. 


May  not  be  specific  to  particular  target 
organ.  Uncertainty  exists  concerning 
relationships  between  tissue  levels  and 
effects.  In  some  cases,  effects  are  inferred 
from  tissue  levels.  In  such  cases,  there  may 
be  uncertainties  associated  with 
relationships  among  tissue  levels  and 
effects.  Bioaccumulation  is  a  phenomenon, 
not  an  effect  and  is  primarily  an  effect  tool, 
except  where  there  is  a  target  level  that  is 
actually  the  measurement  endpoint. 
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Table  7-5  continued 
Effects  assessment 

method  Advantages  Limitations 

Field  studies  of  Primary  advantage  is  that  they  provide  Major  limitation  is  occasional  difficulty  in 

communities  and  direct  information  on  populations  and  detecting  ecologically  important  changes 

populations  communities  of  interest.  from  natural  variability.  In  addition,  it  is 

sometimes  difficult  to  sort  out  patterns 
related  to  contaminants  as  compared  to 
other  environmental  variables. 

Histopathology  studies  Serve  as  indicators  of  effects  in  certain  Population  significance  not  well 

species.  Effects  can  be  measured  understood, 
directly  and  quantified  in  terms  of 
frequency  of  incidence. 

Development  of  Provide  basis  for  evaluating  Unless  relationships  have  been  developed 

exposure-response  implications  of  exposure  for  species  at  site  of  interest,  adjustments 

relationships  concentrations  to  receptors  of  interest,  are  typically  made  to  account  for 

[applicable  to  all  above]  interspecies  extrapolations.  Can  be 

considerable  uncertainty  associated  with 
these  adjustment  factors. 

^  All  tools  liave  varying  degrees  of  difficulty  in  determining  causality. 


7.4.1  Multiple  lines  of  evidence 

In  sediment  assessment,  multiple  lines  of  evidence  are  typically  available  for  the  final  risk 
characterization  (e.g.,  results  of  toxicity  tests,  benthic  community  surveys);  thus,  a 
weight-of-evidence  approach  is  used  to  evaluate  and  characterize  risk.  The  study  design 
will  normally  prescribe  the  lines  of  evidence  required  to  support  evaluation  of  risk  to  a 
particular  assessment  endpoint,  though  the  process  may  be  tiered  or  iterative  (Section 
7.1). 

Characterization  of  risk  relative  to  a  broad  assessment  endpoint  (e.g.,  a  healthy  benthic 
community  which  poses  no  risk  to  wildlife  consumers)  and  the  associated  confidence  in 
the  estimate  may  be  expressed  in  simple  narrative  terms  such  as  low,  moderate,  or  high 
risk  to  the  benthic  community.  Such  statements  represent  the  integration  and  interpre¬ 
tation  of  multiple  lines  of  evidence  which  are  supported  by  various  mathematical  and 
statistical  analyses.  As  emphasized  by  Suter  (1993),  “...each  risk  estimate  will  have  its 
own  assumptions  and  associated  uncertainties.  The  separate  lines  of  evidence  must  be 
evaluated,  organized  in  some  coherent  fashion,  and  explained  to  the  risk  manager  so  that 
a  weight-of-evidence  evaluation  can  be  made.”  A  weight-of-evidence  approach  is  de¬ 
scribed  more  fully  below. 

Though  measurement  endpoints  can  be  directly  characterized  in  terms  of  risk,  the  goal 
of  the  risk  characterization  is  to  estimate  risk  relative  to  the  assessment  endpoints.  Mea¬ 
surement  endpoints  must  be  clearly  linked  back  to  the  original  assessment  endpoints 
when  interpreting  and  communicating  results  of  the  risk  characterization. 
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The  weigh t-of-evidence  approach  requires  best  professional  judgment  and  is  the  process 
by  which  measurement  endpoints  are  related  to  assessment  endpoints  to  evaluate  if  there 
is  a  significant  risk  of  harm  to  the  environment.  The  approach  is  planned  and  initiated  in 
the  problem  formulation  stage,  and  results  are  integrated  at  the  risk  characterization 
stage  (Menzie  et  al  1996). 

In  using  a  weight-of-evidence  approach,  there  may  be  competing  or  contradictory  lines  of 
evidence  and  potential  for  inconsistency  in  interpretation  of  the  significance  of  results. 
Thus,  a  coherent  framework  is  needed  for  evaluating  and  weighting  measurement  end¬ 
points  in  terms  of  their  importance  in  the  final  risk  characterization.  For  example,  in  sedi¬ 
ment  assessment  using  a  triad  approach  (Chapman  1990, 1996),  community  indices, 
analytical  chemistry  data,  and  results  of  toxicity  tests  may  not  all  similarly  indicate  risk 
to  the  benthic  community.  Weighting  of  measurement  endpoints  based  on  the  factors 
described  below  will  focus  decisions  and  risk  characterization. 

7.4. 1. 1  Confidence  in  measurement  endpoints 

In  interpreting  measurement  endpoints  as  part  of  the  overall  risk  characterization,  more 
confidence  may  be  placed  on  nonsubjective  endpoints  (e.g.,  lethality,  growth,  or  repro¬ 
duction)  where  the  nature  of  the  adverse  effect  and  mechanistic  links  with  the  assessment 
endpoint  can  be  clearly  made,  than  on  a  measurement  endpoint  where  the  mechanistic 
link  with  the  assessment  endpoint  may  not  be  clearly  understood  (e.g.,  biomarkers,  his- 
topathology).  The  latter  may  be  used  to  corroborate  or  support  interpretation  of  risk  but 
would  have  less  weight  in  the  characterization.  Confidence  may  also  be  linked  to  statis¬ 
tical  uncertainty,  how  the  value  was  derived  (e.g.,  direct  observation,  modeled  with  de¬ 
fault  parameters).  Factors  affecting  confidence  in  measurement  endpoints  are  described 
in  Sections  7.2.4  and  in  Chapter  18. 

7.4.1. 2  Magnitude  of  response  relati}re  to  the  metric 

The  greater  the  magnitude  of  the  response,  the  more  confidence  or  weight  that  will  nor¬ 
mally  be  placed  on  it.  However,  the  magnitude  of  the  response  can  only  be  evaluated  rela¬ 
tive  to  an  accepted  metric  or  yardstick.  For  example,  judging  whether  a  20%  or  a  50% 
reduction  in  growth  is  a  “high”  level  of  effect  will  be  based  on  precedent  and  established 
metrics  for  that  endpoint. 

7.4.1. 3  Multiple  indications  of  effect 

The  greater  the  number  of  lines  of  evidence  supporting  a  particular  indication  of  risk,  the 
greater  the  confidence  in  that  endpoint  (e.g.,  low  confidence  may  be  placed  on  a  line  of 
evidence  or  measurement  endpoint  which  is  not  supported  by  any  other  evidence).  Ex¬ 
treme  caution  should  be  exercised  in  interpreting  such  results  where  the  outlier  is  indica¬ 
tive  of  an  adverse  effect  and  fits  either  or  both  of  the  above  considerations.  Such  a  result 
may  point  to  a  need  for  further  investigation. 

Ideally,  a  weight-of-evidence  approach  will  express  not  only  the  absolute  value  of  the  mea¬ 
surement  endpoint,  but  the  degree  of  confidence  (weighting)  in  the  endpoint  based  on 
the  above  considerations.  Results  may  be  descriptively  presented  in  tabular  or  graphical 
form  to  aid  in  interpretation. 
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7.4.2  Retrospective  versus  prospective  risk  characterization 

The  information  needed  to  reach  a  management  decision  and  consideration  of  remedia¬ 
tion  options  may  require  both  a  retrospective  risk  characterization  (What  is  the  existing 
risk  of  harm  to  the  environment?)  and  a  prospective  risk  characterization  (Given  the 
probability  of  certain  events  as  a  result  of  a  remedial  alternative  —  e.g.,  mobilization  of 
contaminated  sediments  as  a  result  of  dredging  activities,  biological  changes  such  as 
methylation  of  mercury — what  is  the  prospective  risk  of  harm  to  the  environment?).  For 
example,  a  prospective  evaluation  of  risk  is  important  for  buried  sediments  which  may  be 
exposed  through  a  storm  event  or  other  perturbation.  A  “snapshot”  retrospective  assess¬ 
ment  may  indicate  low  risk,  whereas  a  prospective  assessment  may  indicate  high  prob¬ 
ability  of  risk  and  lead  to  a  different  management  decision.  Other  prospective 
considerations  can  include  transport  and  redistribution  of  contaminated  sediment  off¬ 
site,  seasonal  pulse  events  (lake  bottom  turnover),  etc.  Recommendation  of  appropriate 
and  technically  sound  models  for  such  prospective  analyses  is  part  of  the  problem  formu¬ 
lation  and  study  design  phase. 

A  further  consideration  is  prospective  evaluation  of  remedial  options  which  could  involve 
modeling.  Whatever  the  means,  there  will  be  uncertainty  which  needs  to  be  quantified 
(Chapter  18). 

7.4.3  Risk  estimation  methods 

There  is  a  rapidly  expanding  toolbox  for  risk  estimation  methods  amenable  to  sediment 
assessment.  However,  it  is  important  to  remember  that  these  are  tools  to  characterize  or 
interpret  effects  and  exposure  data  in  terms  of  risk. 

Primary  tools  for  estimating  risk  and  their  application  include  these: 

•  The  quotient  method  for  assessments  (described  briefly  below) 

•  J oint  probability  analysis  to  estimate  risk  (described  briefly  below) 

•  Ecological  models  to  extrapolate  measurement  endpoints  to  assessment 
endpoints,  with  model  uncertainty  analysis  (e.g.,  Monte  Carlo)  for  risk  charac¬ 
terization 

•  Chemical  analyses  of  sediment  and  water  in  combination  with  toxicity  testing 
and  community  analysis  for  assessing  existing  impacts  to  sediment  communi¬ 
ties  and  for  developing  field-based  exposure-response  relationships  that  can  be 
used  for  risk  predictions 

•  Quantification  of  uncertainty  estimates  by  deriving  probability  functions  (or 
ranges  where  data  are  limited  in  screening  assessments)  for  risk  estimates 

7.4.3. 1  Quotient  method 

The  quotient  method  is  frequently  used  for  a  screening  or  preliminary  risk  calculation  for 
contaminated  sediments.  By  itself,  the  quotient  method  does  not  provide  a  complete 
characterization  of  the  magnitude  of  risk  and  uncertainties.  Such  approaches  provide  an 
approximate  risk  index  and  the  confidence/uncertainty  associated  with  this  expression 
of  risk  rests  largely  in  the  confidence/uncertainty  in  the  measurement  endpoints  that 
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were  entered  into  the  calculation.  Hazard  quotients  are  used  to  compare  single  effect  and 
exposure  values  and  can  be  used  whenever  there  is  sufficient  information  to  estimate  the 
expected  environmental  concentration  in  the  media  of  concern,  or  when  it  has  been 
measured,  and  where  there  are  adequate  studies  in  the  literature  or  from  directed  labo¬ 
ratory  and  field  toxicity  tests  to  determine  a  toxicological  benchmark  or  threshold  con¬ 
centration  (such  as  the  no-observed-effects  concentration  [NOEC]  for  acute  or  chronic 
toxicity).  The  hazard  quotient  method  compares  the  estimated  or  measured  exposure 
levels  to  the  measured  or  predicted  threshold  value  for  effect.  Quotients  are  simple  to 
interpret  (/.c.,  quotients  less  than  1  typically  imply  low  risk,  and  quotients  greater  than  1 
imply  that  there  is  a  potential  risk).  However,  in  weighing  the  value  of  hazard  quotients 
in  the  overall  risk  characterization,  weaknesses  such  as  the  lack  of  indirect  effects  evalu¬ 
ation  or  of  incremental  dose  impacts  must  be  considered.  Hazard  quotients  may  be  par¬ 
ticularly  useful  for  determining  priority  contaminants  when  the  site  is  grossly 
contaminated  by  many  chemicals  (Chapter  17). 

In  deciding  what  threshold  or  benchmark  value  is  used  in  the  hazard  quotient,  it  is  im¬ 
portant  to  note  that,  though  standardized  toxicity  tests  often  have  prescribed  methods 
for  interpreting  results,  they  do  not  provide  information  for  interpreting  the  ecological 
significance  of  the  effect.  For  example,  a  15%  reduction  in  growth  may  be  statistically  dif¬ 
ferent  from  a  control  but  may  have  little  meaning  in  terms  of  actual  risk  to  field  popula¬ 
tions.  In  some  cases,  there  are  “rules  of  thumb”  (e.g.,  an  observed  20%  to  25%  mortality 
effect  in  laboratory  toxicity  studies  as  the  threshold  concentration).  Given  identical  mea¬ 
surement  endpoints  and  statistical  rigor,  characterization  of  risk  can  vary  widely  depen¬ 
dent  on  the  threshold  value  that  is  accepted  as  representing  an  ecologically  rather  than 
merely  a  statistically  significant  effect.  In  characterizing  risk,  it  is  important  to  agree,  at 
the  study  design  phase,  what  the  threshold  effect  level  will  be  (e.g.,  lowest-observed-effect 
concentration  [LOEC],  50%  effect)  to  avoid  inconsistent  interpretation  of  results. 

Comparability  of  risk  characterizations  is  enhanced  if  the  effect  level  refers  to  a  quantile 
such  as  LCIO  or  EC20  rather  than  NOEC,  maximum  acceptable  toxicant  concentration 
(MATC),  or  LOEC.  If  quantile  responses  are  routinely  determined,  it  is  relatively  simple 
to  proceed  to  the  next  level  of  complexity:  continuous  exposure-response  relationships. 
Similarly,  where  a  standardized  battery  of  tests  is  used,  the  problem  formulation  stage 
should  consider  and  prescribe  the  expectations  for  acceptable  risk  (e.g.,  all  tests  must 
“pass”). 

Though  traditionally  used  with  toxicological  endpoints,  a  quotient  method  may  be  based 
on  virtually  any  of  the  quantified  effects-based  measurement  endpoints  identified  in  Sec¬ 
tion  7.2.4.I.  Endpoints  such  as  community  structure  indices  for  benthic  invertebrates 
can  be  compared  to  threshold  values  for  toxicity;  an  estimated  chemical  dose  to  a  wildlife 
receptor  (e.g.,  estimated  using  a  food  chain  model)  can  be  compared  to  a  toxic  dose  esti¬ 
mate  such  as  an  NOEC  or  LOEC. 

The  use  of  generic  effects-based  benchmark  concentrations  such  as  SQC  or  sediment 
quality  guideline  (SQG)  values  to  calculate  a  quotient  is  perhaps  the  simplest  form  of  risk 
characterization  (also  known  as  toxic  units  [TU]).  Though  this  approach  can  be  a  useful 
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coarse  screening  tool  that  minimizes  the  need  for  site-specific  toxicity  testing,  interpre¬ 
tation  must  be  made  in  light  of  the  limitations  of  generic  criteria  as  surrogates  for  site- 
specific  exposure-response  data.  The  typically  conservative  nature  of  most,  but  not  all 
criteria  does  minimize  false  negatives  in  evaluating  risk  at  a  site  and  this  approach  is  use¬ 
ful  in  focusing  the  efforts  of  further  study  to  priority  contaminants. 

Population  and  community  endpoints  must  be  interpreted  with  some  caution  if  charac¬ 
terized  using  a  quotient  method.  For  example,  for  data  from  population  surveys  to  be 
directly  relevant  to  the  exposure  at  a  site,  the  numbers  of  organisms  must  be  largely  con¬ 
trolled  by  survival  and  reproduction  within  the  site,  rather  than  by  immigration  and 
emigration. 

7.4«3.2  Probabilistic  risk  characterization 

There  is  a  growing  trend  towards  probabilistic  risk  characterization  in  ERA.  Some  com¬ 
monly  used  techniques  are  described  briefly  below.  Again,  the  data  required  to  drive 
these  methods  must  be  clearly  specified  in  the  study  design  if  such  approaches  are  to  be 
used  to  characterize  risk. 

7.4.3.3  Joint  probability  analyses 

This  approach  can  be  used  to  estimate  the  risk  of  chemical  concentrations  exceeding  tox¬ 
icity  thresholds,  including  sediment  quality  guidelines  for  a  given  time  period  in  a  given 
area  (Long  eta/.  1995).  A  cumulative  probability  distribution  of  chemical  concentrations 
is  developed,  and  then  the  cumulative  probability  distribution  is  used  to  determine  the 
risk  (probability)  of  exceeding  the  toxicity  threshold  (Chapter  18). 

7.4.3.4  Continuous  exposure-response  model 

Data  at  the  individual  or  population  level  are  used  to  generate  probabilistic  estimates  of 
risk.  The  use  of  population-level  models  in  risk  characterization  is  increasing,  especially 
Monte  Carlo  simulations  which  provide  a  single  estimate  of  risk  for  a  single  exposure 
concentration  based  on  a  series  of  model  simulations  using  randomly  selected  sets  of 
parameters.  The  entire  procedure  is  repeated  for  a  series  of  exposure  concentrations  and 
the  estimate  of  risk  can  be  plotted  as  a  function  of  the  exposure  concentration  (Bartell  et 
al  1992).  Standardized  computer  programs  are  available  to  conduct  Monte  Carlo  simu¬ 
lations,  but  these  generally  work  best  with  real  data.  Disadvantages  include  the  fact  that 
use  of  computer  simulations  may  overlook  the  implicit  assumptions  that  contribute  to 
the  uncertainty  of  the  ERA  and  that,  due  to  the  ease  of  running  the  model,  inappropriate 
use  and  interpretation  can  occur  among  practitioners  not  familiar  with  the  ecological 
basis  of  the  parameters  entered  into  the  simulation. 

7.4.4  Spatial  characterization  of  risk 

On-site  contamination  gradients  {e.g.,  depth,  breadth)  can  be  used  to  demonstrate  on-site 
exposure-response  functions.  Where  such  data  have  been  collected,  they  should  be  used 
as  part  of  the  risk  characterization.  The  paired  comparison/mapping  of  effects  data  from, 
for  example,  toxicity  testing  or  community  survey  data  with  environmental  concentra- 
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tion  (analytical  chemistry)  data  provides  an  important  spatial  dimension  to  risk  charac¬ 
terization  needed  in  final  management  decisions. 

7.4.5  Temporal  characterization  of  risk 

Risk  may  also  be  temporally  characterized  in  terms  of  seasonal  or  long-term  trends  to 
account  for  varying  exposures  and  changes  in  species  susceptibility  to  chemical  effects. 
The  use  of  chemical  data  from  different  times  and  locations  to  derive  estimates  of  expo¬ 
sure  concentrations  for  risk  characterization  must  be  matched  to  the  distribution,  life 
stages,  and  activities  of  the  selected  receptors  and  to  specifics  for  sediments  (e.g.,  turn¬ 
over,  storm  events,  bioturbation). 

7.4.6  Uncertainty  analysis 

Risk  characterization  must  include  the  uncertainty  in  the  risk  estimate  and  discuss  the 
way  in  which  it  might  affect  interpretation  and  decision-making.  Sources  of  uncertainty 
include  natural  variation,  missing  information,  and  error  associated  with  estimates  and 
extrapolations.  Uncertainty  associated  with  various  endpoints  is  discussed  in  Chapter  18. 

7.4.7  Other  considerations 

In  applying  ERA  to  sediment  cleanup  issues,  additional  factors  that  may  influence  man¬ 
agement  decisions  need  to  be  considered  in  characterizing  the  site. 

7.4.7.7  Comparison  to  a  reference  site 

Risk  may  be  characterized  relative  to  an  “uncontaminated”  reference  site.  Often,  the  ref¬ 
erence  site  does  not  represent  pristine  conditions  but  is  representative  of  both  the  ambi¬ 
ent  or  background  conditions  or  sediment  type  (grain  size,  TOC,  etc)  for  the  area  and 
historical  contamination.  Multiple  reference  stations  may  be  needed  to  cover  the  range 
of  conditions.  The  reference  site  is  often  not  a  perfect  control  and  will  add  to  the  uncer¬ 
tainty  of  the  analyses . 

7.4. 7.2  Characterization  of  sensitive  areas/speeies 

A  toxicologically  based  estimate  of  risk  may  lead  to  the  removal  or  perturbation  of  con¬ 
taminated  sediments  and  the  benthic  or  other  communities  they  support.  The  estimate 
of  risk  must  be  carefully  weighed  against  the  disturbance  to  sensitive  habitat/species  that 
can  result  from  any  remediation.  Consideration  of  such  effects  as  loss  of  habitat  and  re¬ 
colonization  times  of  benthic  communities  will  become  equally  important  in  decision¬ 
making  as  risk  assessment  that  focuses  on  the  effects  of  contaminants. 

7.5  Decision-making 

Decision-making  responsibility  rests  with  the  risk  manager  after  the  risk  assessor  pro¬ 
vides  an  objective  and  technically  defensible  evaluation  of  the  present  risks  and  of  the 
risks  associated  with  various  remedial  options.  The  first  steps  of  the  process  for  sediment 
risk  assessment  (see  Section  7.2)  should  be  educational,  where  risk  assessors  and  risk 
managers  achieve  a  common  understanding  of  the  management  objectives  and  the  na¬ 
ture  and  objectives  of  the  site-specific  assessment.  Risk  managers  need  to  understand 
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that  use  of  multiple  assessment  and  measurement  endpoints  and  a  combination  of  as¬ 
sessment  approaches  will  be  necessary  to  determine  existing  and  potential  ecological  risk 
at  a  site  with  contaminated  sediments.  Several  concepts  should  be  addressed  in  consul¬ 
tation  with  the  risk  assessor  before  the  decision-maker  or  risk  manager  initiates  prelimi¬ 
nary  screening,  selection  of  assessment  endpoints,  and  development  of  the  conceptual 
model: 

•  Ecological  risk  assessment  may  evaluate  one  or  many  stressors  and  ecological 
components  and  thus  must  be  flexible,  i.e.,  involve  prospective,  retrospective,  or 
a  combination  of  approaches. 

•  Field  studies  are  essential  for  reducing  uncertainty  in  exposure  and  effects 
assessment  and  for  validating  assumptions  or  predictions. 

•  A  weight-of-evidence  approach  is  recommended  for  use  for  risk  characteriza¬ 
tion,  along  with  explanations  of  uncertainty.  In  particular,  professional  judg¬ 
ment  is  key,  and  decision-making  must  involve  the  consensus  of  a  team  of 
chemists,  toxicologists,  ecologists,  and  risk  assessment  experts.  Professional 
judgment  involves  the  integration  of  technical  expertise  in  ecological  sciences 
and  is  important  particularly  in  problem  formulation.  It  must  be  objective,  and 
involves  developing  iterative  processes  for  assessment  and  analysis,  evaluating 
sources  of  uncertainty,  and  interpreting  and  communicating  the  ecological 
significance  of  predicted  or  observed  effects  to  decision-makers. 

•  Ecological  risk  is  expressed  in  a  variety  of  ways,  ranging  from  probabilistic  to 
qualitative  comparison  of  effects  and  exposure. 

Risk  management  is  a  distinctly  different  process  from  risk  assessment.  The  risk  assess¬ 
ment  establishes  that  a  risk  is  present  and  defines  a  range  or  magnitude  for  the  risk.  The 
risk  assessors  can  recommend  a  target  cleanup  goal  to  the  risk  manager  that  will  achieve 
the  goal  of  protection  of  the  assessment  endpoint,  and  also  express  the  risk  associated 
with  the  remedial  options  and  the  resulting  net  risk  reduction  to  the  assessment  end¬ 
points  for  the  remedial  options. 

7.5.1  Preliminary  screening  assessment 

The  risk  manager  and  assessor  need  to  agree  on  a  preliminary  list  of  assessment  end¬ 
points.  The  risk  manager  uses  the  results  of  the  risk  assessor  s  preliminary  screening  to 
decide  if  little  or  no  risk  exists  or  if  ERA  should  be  continued  in  order  to  develop  a  site- 
specific  cleanup  goal  or  to  reduce  the  uncertainty  in  the  evaluation  of  risk. 

7.5.2  Iterative  assessment 

Subsequent  iterations  of  the  risk  assessment  involve  a  more  complex  problem  formula¬ 
tion  stage  and  may  require  refinement  of  assessment  endpoints.  The  risk  assessors  and 
technical  consulting  or  expert  teams  decide  on  the  need  for  iteration  for  data  acquisition/ 
verification  or  possible  modification  of  assessment  and  measurement  endpoints. 

The  risk  assessor  and  risk  manager  should  coordinate  with  appropriate  professionals  in 
making  decisions  related  to  assumptions,  models,  etc.  Risk  assessors  and  the  technical 
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team  ideally  should  work  together  in  developing  recommendations  for  specific  target 
cleanup  goals. 

The  risk  assessors  characterizing  risk  using  weight  of  evidence  can  recommend  a  target 
cleanup  goal  to  the  risk  manager.  This  is  where  risk  management  is  a  distinctly  different 
process  from  risk  assessment.  The  risk  manager  is  the  one  who,  through  evaluation  of  re¬ 
medial  options,  target  cleanup  goals,  and  the  magnitude  of  the  risk,  as  well  as  manage¬ 
ment/regulatory  issues,  makes  the  decision  on  the  cleanup  levels  and  actions. 

In  evaluating  remedial  options,  risk  managers  will  require  additional  professional  assis¬ 
tance  to  interpret  the  implications  of  baseline  ERA  and  remedial  options.  Because  of  the 
nature  of  SERA  and  remedial  options  available,  it  will  be  implicit  as  part  of  decision 
making  that  risk  assessors  will  evaluate  the  risk  resulting  from  disturbances  of  sediment 
habitat  and  also  recovery  rates  under  different  scenarios. 

The  potential  for  adverse  effects  varies  with  each  remedial  alternative.  For  each  option, 
ERA  can  be  used  to  evaluate  risk  during  and  after  remediation  for  both  action-specific 
risk  (e.g.,  likelihood  of  physical  destruction  of  habitat  during  dredging)  and  chemical- 
specific  risk.  Adverse  effects  of  remedial  actions  that  involve  excavation  (e.g.,  in  wetlands), 
dredging,  redisposal,  or  containment  may  include  the  following: 

increased  bioavailability  of  toxic  contaminants  during  the  remedial  action;  may 
result  in  pulse  releases  to  vicinity  and  downstream  areas) 

•  Physical  removal  of  vegetation  and  animals 

•  Increased  toxicity  by  transformation  of  chemicals  or  release  from  sediment 
during  treatment 

•  Disturbance  during  human  activity  at  a  site,  such  as  annoying  noise  to  marine 
mammals,  spawning  harassment,  alienation  of  some  organisms 

The  comparative  risk  process  evaluating  residual  chemical  risk  versus  action-specific  risk 
(necessarily  predictive)  requires  information  concerning  rates  of  community  recovery. 
Decisions  may  result  in  a  less  protective  cleanup  level  resulting  in  short-term  risk  but 
resulting  in  eventual  reduced  risk  through  natural  recovery. 

7.6  Post-remediation  monitoring 

An  additional  iteration  of  any  SERA  is  a  long-term  review  of  remediation  success  or  of 
natural  recovery.  Also,  if  there  will  be  residual  risk  or  contaminants  left  in  place  post-re¬ 
mediation,  a  monitoring  plan  may  be  required.  The  magnitude  and  complexity  of  such 
monitoring  are  dependent  on  the  remedy,  residual  risk,  and  contaminant  class,  as  well  as 
other  factors. 

Post-remediation  monitoring  is  the  final  iteration  of  the  SERA.  Its  purpose  is  to  validate 
the  results  of  preceding  tiers  in  the  ERA  and  either  to  evaluate  the  effectiveness  of  the  re¬ 
mediation  selected  based  upon  the  remediation  goals  or  to  confirm  that  acceptable  deci- 


Physical  (local)  destruction  of  habitat  and  dominant  aquatic  species  (may  also 
involve  destruction  of  sediment  and  redox  strata  resulting  in  potentially 
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sions  were  made.  Evaluations  are  typically  based  on  one  or  more  of  the  assessment  end¬ 
points  selected  in  the  SERA. 

The  tools  used  to  make  the  evaluation  are  typically  the  same  tools  as  those  used  in  the 
SERA,  unless  new  techniques  become  available  for  the  same  measurement  endpoint. 
However,  if  the  same  tools  are  used,  they  may  be  more  specific  to  the  measurement  end¬ 
point,  more  easily  obtained,  or  more  cost  effective. 

Sampling  stations  or  transects  can  be  targeted  specifically  based  on  extent  of  remedial 
action  and  spatial  extent  of  adverse  effects  or  areas  of  concern.  The  monitoring  time 
frame  and  sampling  episodes  should  be  based  upon  the  contaminants  remaining  in  the 
system  and  their  characteristics,  the  magnitude  of  the  contamination  remaining,  other 
point  sources  (for  recontamination),  the  anticipated  time  of  recovery  for  the  system,  and 
data  needs  for  the  evaluation. 

Ideally,  data  needs  for  post-remediation  monitoring  should  be  evaluated  during  the 
SERA,  resulting  in  triggers  for  time-dependent  sampling  events  to  evaluate  trends.  The 
intensity  of  post-remediation  monitoring  should  decrease  with  time,  reflecting  the  con¬ 
fidence  of  the  risk  assessor  in  the  ERA  predictions.  Accordingly,  some  mechanism  or  trig¬ 
ger  for  sunsetting  such  monitoring  is  also  required. 

7.7  Research  needs 

The  ERA  process  for  a  contaminated  sediment  site  encompasses  many  sediment-specific 
issues  due  to  the  complex  nature  of  the  sediment  environment.  Due  to  this  complexity, 
there  are  some  limitations  to  SERA  which  can  only  be  addressed  by  further  research. 
Research  areas  and  needs  specific  to  contaminated  sites  are  outlined  below,  categorized 
according  to  function  in  the  SERA  process:  1)  exposure  and  effects  analysis,  2)  risk  char¬ 
acterization  and  assessment,  3)  cleanup  and  monitoring  of  the  contaminated  site,  and  4) 
issues  specific  to  wetlands. 

7.7.1  Exposure  and  effects  analysis 

Exposure  and  effects  analysis  are  the  two  processes  which  must  be  combined  in  order  to 
develop  any  risk  assessment.  For  contaminated  sediment  sites,  the  following  research 
areas  and  needs  exist: 

•  Improved  dose-response  models  are  needed  to  effectively  link  exposure  mea¬ 
surements  to  effects.  For  example,  for  measurement  endpoints  such  as  tissue 
concentrations  to  be  useful,  a  mechanistic  relationship  with  an  adverse  effect 
must  be  established  (Le.,  What  impact  does  a  concentration  of  a  contaminant  in 
an  organism  have  on  that  organism,  particularly  for  congener  PCBs,  mercury, 
PCDDs,  and  PCDFs?).  More  information  is  needed  for  different  species  on 
levels  of  persistent  (bioaccumulative/bioconcentrating)  compounds  in  tissue 
which  cause  adverse  effects  (e.g.,  decrease  in  egg  production,  reduced  viability, 
reproductive  impairment).  This  is  important  for  sediment  risk  assessment 
because  of  the  variety  of  exposure  pathways  resulting  from  contaminated 
sediments. 
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•  There  is  a  need  to  develop  new  toxicity  tests  which  focus  on  chronic  endpoints 
{i.e.,  growth  and  reproduction  as  well  as  survival),  which  can  be  related  to 
population-level  endpoints.  There  is  also  a  need  to  refine  existing  tests  so  that 
factors  other  than  contaminants  can  be  evaluated  {e.g.,  habitat  including  grain 
size  and  TOC,  feeding  rates,  water  quality). 

•  A  better  understanding  is  needed  of  bioavailability  in  sediments  that  results  in 
exposure.  Specifically,  are  there  other  models/factors  in  addition  to  EqP  and 
AVS  (USEPA 1993, 1994b)  which  may  be  able  to  more  accurately  predict 
exposure  and  any  resulting  toxicity  from  contaminated  sediments? 

•  The  environmental  fate  and  toxicity  of  various  compounds  should  be  investi¬ 
gated.  Are  they  bioavailable?  What  concentrations  in  sediment  are  protective  of 
species  and  habitats?  A  classification  system  for  the  fate  and  effects  of  com¬ 
pound  classes  encountered  in  SERA  is  needed. 

•  Further  investigation  of  the  factors  affecting  the  bioavailability  of  contaminants 
often  found  at  waste  sites  {e.g.,  metals,  PCBs,  and  PAHs)  and  of  useful  indicator 
organisms  is  needed.  In  particular,  what  environmental  and  biological  factors 
increase/decrease  the  bioavailability  of  metals  in  wetland  (freshwater,  estuarine, 
and  marine)  sediments? 

•  There  is  a  need  to  develop  and  evaluate  environmental  investigation  strategies 
and  bioassessment  techniques  in  order  to  learn  more  about  the  sensitivities  of 
various  bioassays/organisms  {i.e.,  validation  studies).  This  will  allow  the  risk 
assessor  to  make  sound  recommendations  for  using  one  test  over  another. 

•  Methods  of  sediment  toxicity  identification  evaluation  (TIE)  should  be  im¬ 
proved  so  that  a  wider  range  of  organisms  can  be  used  and  detection  of  poten¬ 
tial  causative  agents  improved. 

•  Existing  bioassessment  tools  that  are  not  routinely  used  at  sites  with  contami¬ 
nated  sediments  should  be  applied  to  test/ demonstrate  their  utility  {e.g. , 
subcellular  biomarkers).  Biomarkers  may  be  a  relevant  endpoint,  but  a  cause 
and  effect  relationship  must  first  be  demonstrated. 

•  A  better  understanding  is  required  of  biological  interactions  to  determine  which 
indirect  effects  are  most  likely  to  translate  to  direct  effects.  This  may  include  the 
development  of  population  models  to  synthesize  exposure  and  effects  for 
multiple  species. 

•  A  better  understanding  is  required  of  benthic  community  structure  and 
function.  It  is  particularly  important  to  determine  and  define  what  should  be 
present  in  which  habitats  and  what  is  functionally  necessary  in  those  habitats.  A 
general  improvement  in  evaluations  is  required  for  systems  other  than  small/ 
medium  pool-run  streams,  in  particular  for  big  river  systems. 

•  There  is  a  need  for  agreement/standardization  of  the  appropriate  level  of 
statistical  significance  for  field  measurement  endpoints. 

•  There  is  a  need  for  quality  assurance/quality  control  (QA/ QC)  guidance  for 
field  collections  of  biological  samples. 
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7.7.2  Risk  characterization  and  assessment 

Risk  characterization  and  risk  assessment  synthesize  the  exposure  and  effects  results. 

Areas  and  needs  for  future  research  include  these: 

•  Developing  a  system  for  tracking  what  has  been  done  and  what  is  being  done  at 
contaminated  sites  with  regard  to  the  sediment  risk  assessment  process  (lessons 
learned).  For  example,  cases  studies  including  data  from  RI/FS  unpublished 
literature,  negative  data,  and  other  examples  could  be  summarized  and  made 
available  to  others.  A  database  of  completed  risk  assessments  could  be  one  way 
of  summarizing  much  of  this  information. 

•  Conducting  sediment-based  synoptic  chemistry  and  bioassessment  studies  with 
a  variety  of  endpoints  {e.g.,  the  triad.  Chapman  1990, 1996)  to  provide  more 
data  for  field  verification  of  benchmarks.  For  example,  laboratory  testing 
indicates  toxicity,  yet  a  field  evaluation  of  the  benthic  community  indicates  no 
adverse  effects.  How  are  these  two  experimental  results  reconciled  in  the 
context  of  an  SERA? 

•  Critically  evaluating  sediment  benchmark  values  —  e,g,  effects  range  low  (ERLs) 
(Long  et  al.  1995)  and  SQC  —  relative  to  the  technical  basis  of  their  derivation 

•  Summarizing  commonly  used  toxicological  benchmarks  for  risk  assessments. 

For  example,  20%  eggshell  thinning  (for  birds)  is  typically  taken  as  an  indication 
of  an  adverse  effect.  These  benchmarks  should  be  summarized  and  standard¬ 
ized  if  possible. 

•  Determining  the  relevance  of  behavioral  responses.  Can  the  cause-and-effect 
connection  due  to  the  presence  of  a  contaminant  be  demonstrated? 

•  Determining  the  best  way  to  deal  with  mixtures  of  contaminants  in  determining 
risk.  For  instance,  is  the  additivity  approach  adequate  and  appropriate  for  joint 
action  toxicity? 

•  Developing  watershed  approaches  for  single  watersheds  affected  by  more  than 
one  contaminated  site  and  for  multiple  watersheds  affected  by  one  large  single 
site 

•  Developing  a  landscape  approach  for  measurement  endpoints  and  study 
design,  and  for  conducting  the  risk  assessment.  This  approach  integrates  a 
spatial  component  as  part  of  the  risk  assessment  process. 

•  Determining  whether  volatile  organic  compounds  (VOCs)  that  pass  through 
sediment  from  groundwater  discharges  are  a  threat  to  benthic  communities 

7.7.3  Cleanup  and  monitoring 

Even  if  a  substantive  risk  is  determined  to  exist  at  a  contaminated  sediment  site,  it  may 
be  better  to  leave  the  contamination  undisturbed  if  the  cleanup  or  remediation  will  result 
in  greater  ecological  risk.  In  this  case  or  if  a  site  is  cleaned  up,  a  monitoring  program  may 
be  required  to  determine  the  effectiveness  of  any  cleanup.  Several  research  areas  and 
needs  exist  in  this  regard: 
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•  Investigating  the  time  required  for  the  return  of  habitat  function  following  a 
hazardous  materials  release  through  monitoring  at  sites  after  cleanup  and 
mitigation  is  completed 

•  Determining  what  parameters  should  be  monitored  and  how  a  monitoring 
program  should  be  designed  to  determine  the  success  of  remediation 

•  Quantifying  the  risk  of  remediation.  For  instance,  remediation  of  contaminated 
sediments  may  result  in  resuspension  of  contaminants.  What  tests  and  models 
can  be  used  to  predict  and  evaluate  this  risk? 

•  Evaluating  the  effectiveness  of  remediation  technology,  in  particular  assessing 
the  use  of  capping  as  a  remediation  technology  for  contaminated  sediments. 
What  is  the  effectiveness  of  capping  for  different  sediment  contaminants  in 
different  environments  and  what  is  the  best  cap  design?  Does  groundwater  flow 
alter  the  effectiveness  of  the  cap?  What  is  the  actual  lifetime  of  a  cap? 

7.7.4  Wetland  issues 

Wetland  environments  are  extremely  complex  ecological  systems  that  may  contain  con¬ 
taminated  sediments.  Because  of  this  complexity,  several  specific  research  areas  and 
needs  have  been  identified: 

•  Deriving  a  methodology  for  determining  when  to  clean  up  contaminated 
sediments  in  a  generic  wetland  and  when  to  leave  them  in  place.  Studies  are 
needed  on  the  effects  of  leaving  a  wetland  contaminated  versus  the  effects  of 
cleaning  it  up.  What  is  the  time  frame  of  recovery  of  wetland  species  from  both 
food  chain  contamination  and  physical  disturbance  of  cleanup?  A  cost-benefit 
type  of  analysis  may  be  appropriate. 

•  Researching  methods  of  enhancing  and  restoring  specific  contaminated 
wetlands  that  minimize  disturbance.  What  type  of  wetlands  have  a  greater 
success  with  restoration  in  different  regions  of  the  country?  What  is  the  risk  of 
remediation  when  using  different  remediation  technologies  {e.g.,  bioremedia¬ 
tion  or  in-place  remediation)? 

•  Determining  the  importance  of  contaminants  (c.^.,  mercury,  PCBs)  in  wetland 
sediments  as  sources  of  contamination  for  habitats  and  biota  outside  of  wetland 
areas 

Specific  application  of  ERA  to  wetlands  has  been  addressed  in  a  recent  SETAC  workshop 
(Lewis  et  al ,  in  press) . 

7.5  Management  issues 

7.5.1  Communication 

There  is  a  need  for  increased  communication  throughout  the  SERA  process  to  assist  in 
site  management.  Open  discussion  of  issues  which  lead  to  the  selection  of  initial  assess¬ 
ment  endpoints,  and  subsequently  to  clearly  defined  final  assessment  endpoints,  should 
resolve  misunderstandings  and  alternate  interpretations  as  to  what  is  being  evaluated 
and  why.  Communication  resulting  in  total  transparency  (to  the  extent  legally  possible) 
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should  defuse  much  of  the  apparent  conflict  which  currently  exists  related  to  contami¬ 
nated  sites’  SERA, 

At  present,  site  management  suffers  less  from  a  lack  of  tools  than  from  a  lack  of  applica¬ 
tion  of  a  clearly  defined  and  stated  process.  This  results  in  inadequate  communication 
between  not  only  the  risk  manager  and  the  risk  assessors  but  also  between  the  other  par¬ 
ties  involved  with  a  site.  For  example,  if  assessment  endpoints  are  not  clearly  defined  and 
stated,  it  is  likely  that  one  if  not  all  of  the  parties  involved  will  view  the  assessment  point 
differently.  When  this  occurs,  conflicts  will  arise  at  the  decision  stage. 

For  all  parties  involved,  the  stakes  are  high  at  contaminated  sediment  sites.  Various  agen¬ 
cies  have  legal  mandates  related  to  sediment  management.  Remediation  of  sediment  con¬ 
tamination  includes  financial  interests,  and  the  general  public  have  their  own  interests. 
Because  of  these  issues,  over-  or  underestimating  the  ecological  risk  posed  by  a  contami¬ 
nated  sediment  can  have  far-reaching  ramifications.  The  balancing  or  lack  of  balancing 
of  these  positions  can  translate  into  delays  in  the  site  management  schedule  and  into 
overall  increased  cost  of  assessment  in  addition  to  the  conflicts  which  result.  Increased 
costs  can  result  from  redundant  sampling  efforts,  increased  management/oversight 
costs,  multiple  efforts  to  analyze  data  and  characterize  the  risks,  and  in  the  worst  cases, 
incorrect  decisions  for  managing  the  sediment  risks. 

In  addition  to  increased  risk  assessment  communication,  all  parties  must  understand  all 
responsibilities  (mandates)  and  limitations.  For  example,  there  may  be  impediments  to 
inclusion  of  all  potential  sources  of  ecological  risk  from  contaminated  sediments  based 
upon  what  the  statutes  regulate.  For  example,  USEPA  Superfund  does  not  regulate  oil  and 
related  materials,  as  well  as  a  variety  of  other  materials  which  are  not  specified  as  “haz¬ 
ardous  substances,”  nor  does  it  regulate  physical  disturbances. 

Through  this  document  and  other  efforts,  agreement  must  be  reached  on  what  is  to  be 
accomplished  (and  what  can  be  accomplished)  through  the  management  of  contami¬ 
nated  sediments.  In  particular,  there  must  be  agreement  on  what  level  of  residual  ecologi¬ 
cal  risk  is  acceptable  and  what  sediment-associated  risk  is  ecologically  relevant. 

The  potentially  confrontational  nature  of  the  risk  management  decisions  should  be  di¬ 
minished  by  these  actions: 

•  Ensuring  a  defined  framework  for  the  development  of  the  conceptual  model  and 
selection  of  the  assessment  and  measurement  endpoints  such  that  there  is  a 
clear,  scientifically  defensible  justification  for  the  data  which  need  to  be  col¬ 
lected 

•  Developing  how  the  exposure  and  effects  will  be  evaluated  and  how  a  scientifi¬ 
cally  defensible  study  will  be  designed  to  answer  the  questions  identified 
through  problem  formulation,  such  that  a  level  playing  field  is  established  for 
the  characterization  of  the  sediment  risk 

•  Developing  monitoring  plans  for  validation  of  the  effectiveness  of  the  actions 
taken  for  the  particular  site,  which  are  founded  in  the  risk  assessment  process 
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and  only  implemented  with  a  clear  statement  as  to  how  the  data  will  be  inter¬ 
preted 

7.5.2  Residual  risk 

The  remediation  of  contaminated  sediment  sites  is  unlikely  to  remove  all  ecological  risks, 
unless  the  remedy  is  to  remove  all  of  the  contamination.  This  option  is  generally  techni¬ 
cally  and  financially  unrealistic  and  may  not  be  scientifically  justifiable.  It  follows  that 
residual  ecological  risk  will  remain  after  the  contaminated  site  is  remediated.  Currently, 
this  is  an  uncomfortable  position  for  regulators,  the  regulated  party,  as  well  as  those  with 
other  interests  in  the  assessment  endpoints.  Regulations,  including  those  not  directly 
applicable  to  the  management  of  contaminated  sites,  must  be  considered  and  an  agree¬ 
ment  reached  as  to  how  these  post-remediation  issues  can  be  addressed.  For  example, 
natural  resources  damage  assessments  (NRDAs)  are  significant  considerations  in  post¬ 
remediation  liability  to  financially  accountable  parties.  Statements  of  the  residual  risk 
currently  have  important  implications  to  both  the  natural  resource  trustees  and  the  fi¬ 
nancially  accountable  parties. 

7.6  Recommendations 

•  The  current  risk  paradigm  is  appropriate  for  ERA  at  contaminated  sediment 
sites  (Chapter  1). 

•  A  tiered,  iterative  approach  to  sediment  risk  assessment  should  be  used.  This 
will  maximize  efficiency  of  effort  and  resources  in  reaching  decisions.  At  the  end 
of  each  tier,  the  information  generated  should  be  critically  evaluated  to  deter¬ 
mine  if  an  additional  tier  is  needed,  if  a  decision  can  be  made,  or  if  additional 
information  is  needed  to  reach  a  decision. 

•  Communication  between  risk  assessor,  risk  manager,  and  all  involved  parties  is 
essential  at  all  stages  of  SERA.  In  particular,  all  parties  should  agree  to  the 
assessment  and  measurement  endpoints  which  will  drive  the  risk  assessment, 
understand  the  basis  of  the  risk  assessment  process,  and  understand  the  need 
for  flexibility  in  accommodating  unique  site  characteristics. 

•  A  weight-of-evidence  approach  using  best  professional  judgment  to  the  inter¬ 
pretation  of  data  should  be  used  in  conducting  an  SERA. 

•  The  ecological  risk  associated  with  contaminated  sites  should  not  be  evaluated 
out  of  context  of  the  environmental  system  of  which  it  is  part,  including  the 
cumulative  risk  at  the  watershed  or  landscape  level. 

•  Ecological  risk  assessment  of  contaminated  sediments  needs  to  consider  both 
spatial  and  temporal  components. 

•  Assessment  endpoints  need  to  be  associated  with  measurement  endpoints 
through  the  route  of  exposure  and  the  mechanism  of  effect.  The  measurement 
endpoints  must  be  technically  sound  and  defensible,  and  whenever  possible,  at 
least  one  measurement  endpoint  should  be  field-based. 
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•  While  resources,  legal  issues,  and  site  management  schedules  are  important, 
they  are  not  justification  for  poor  science. 

•  The  initial  risk  assessment  will  characterize  the  existing  risk  at  the  site  in  a 
retrospective  fashion.  In  considering  remedial  options  and  reaching  final 
decisions,  a  prospective  risk  assessment  will  be  required  to  evaluate  the  efficacy 
of  the  remedial  options  in  achieving  risk  reduction  goals. 
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8.1  Introduction 

Due  to  the  wide  range  of  questions  that  may  arise  as  part  of  a  sediment  assessment,  meth¬ 
ods  that  have  been  used  in  the  laboratory  and  field  to  evaluate  potential  biological  effects 
of  sediment-associated  contaminants  are  as  diverse  as  those  for  testing  any  type  of  envi¬ 
ronmental  media.  A  complete  summary  of  these  methods  is  beyond  the  scope  of  this 
short  paper;  however,  comprehensive  reviews  on  the  subject  are  available  elsewhere 
(Giesy  and  Hoke  1989;  Burton  1991;  Ingersoll  1995).  The  purpose  of  this  essay  is  to  iden¬ 
tify  measurement  endpoints  commonly  used  for  evaluating  contaminated  sediments,  in 
particular  from  a  laboratory  versus  field  perspective.  My  focus  will  be  upon  effects  related 
to  toxicity  of  sediment  contaminants  as  opposed,  for  example,  to  physical  disturbance  of 
sediments  and  associated  benthic  habitats. 

8.2  Measurement  endpoint  used  in  sediment  assessments 

From  a  generic  point  of  view,  the  major  assessment  endpoint  with  respect  to  the  direct 
toxicity  {i.e.,  effects  not  manifested  at  higher  trophic  levels  through  bioaccumulation)  of 
sediment-associated  contaminants  is  related  to  the  presence  of  benthic  communities  that 
exhibit  functional  and  structural  characteristics  that  would  exist  in  the  absence  of  im¬ 
pacts  associated  with  sediment  contaminants.  From  a  practical  standpoint,  assessment 
endpoints  usually  are  expressed  in  terms  of  a  “use”  statement,  e.g.,  the  desire  to  support 
a  sustainable  fishery  (and  hence,  a  viable  benthic  food  web).  Unfortunately,  the  linkage 
between  assessment  and  measurement  endpoints  is  at  best  uncertain.  Part  of  this  is  re¬ 
lated  to  the  fact  that  with  few  exceptions,  natural  structure  or  function  is  unknowable, 
and  thus  typically  is  approximated  through  the  use  of  comparative  toxicological/ecologi¬ 
cal  analysis  of  reference  sites.  Toxicological  analyses  usually  are  conducted  using  labora¬ 
tory-based  measurement  endpoints,  while  an  ecological  perspective  typically  is  gained 
through  surveys/summaries  of  field-based  endpoints  describing  benthic  community 
structure.  Irrespective  of  whether  measurement  endpoints  are  derived  from  laboratory  or 
field  data,  because  of  the  relatively  undefined  nature  of  what  exactly  constitutes  an  ac¬ 
ceptable  benthic  community,  endpoints  used  under  different  scenarios  can  differ  quite 
markedly. 

Although  field-based  measurement  endpoints,  such  as  organism  abundance  and  commu¬ 
nity  diversity,  may  be  easier  to  relate  to  assessment  endpoints  than  results  obtained  from 
laboratory  tests,  there  are  a  number  of  reasons  why  laboratory  assays  are  more  com¬ 
monly  used  in  assessments  of  sediment  toxicity  than  field-based  analyses.  From  a  practi- 
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cal/logistical  standpoint,  laboratory  assays  typically  are  less  expensive  and  more  rapid 
than  surveys  of  benthic  community  structure.  This  enables,  for  example,  more  intensive 
sampling  of  sites,  particularly  at  screening  stages  of  an  assessment.  In  addition,  less  ex¬ 
pertise  is  generally  required  for  interpretation  of  the  measurement  endpoints  (e.g.,  de¬ 
creased  survival,  growth,  or  reproduction)  typically  used  in  laboratory  tests  as  opposed, 
for  example,  to  evaluating  differences  in  diversity  or  similarity  indices  associated  with 
extant  benthic  communities.  This  facilitates  more  ready  application  of  results  to  regula¬ 
tory/management  scenarios  in  which  the  decision-maker  may  not  be  a  trained  biologist. 
In  this  context,  measurement  endpoints  based  on  laboratory  assays  also  can  be  very  use¬ 
ful  as  a  “real-time”  tool  for  managers  monitoring  the  effectiveness  of  remedial  activities. 

Another  factor  that  contributes  to  the  extensive  use  of  laboratory  assays  for  sediment 
assessments  is  related  to  the  general  concept  of  cause  and  effect.  The  structure  and  func¬ 
tion  of  benthic  communities  is  not  necessarily  dictated  only  by  sediment  contaminants; 
differences  in  species  abundance  or  composition  also  may  be  influenced  by  factors  such 
as  transient  toxicity  events  associated  with  water  overlying  the  sediment,  physical  distur¬ 
bance,  lack  of  suitable  habitat  (e.g.,  substrate,  dissolved  oxygen),  resource  limitations, 
and/ or  biotic  interactions  (e.g.,  competition,  predation).  On  the  other  hand,  if  a  sediment 
exerts  overt  toxicity  to  one  or  more  sensitive  sentinel  species  in  a  laboratory  setting,  it  is 
reasonable  (in  the  absence  of  sampling  artifacts)  to  believe  that  this  adverse  effect  would 
be  manifested  in  some  manner  in  the  field.  Assessment  of  sediments  in  the  laboratory 
also  facilitates  the  development  of  dose-response  relationships.  The  greater  degree  of 
control  present  in  a  laboratory  setting  affords  the  potential  for  more  accurate  exposure 
assessment  than  can  be  derived  solely  from  a  field  analysis;  this  is  particularly  true  for 
most  contaminated  sediments  where  one  usually  is  dealing  with  complex  mixtures  of 
contaminants. 

This  is  not  to  imply  that  field  observations  are  not  critical  in  assessing  contaminated  sedi¬ 
ments.  The  incorporation  of  both  field  and  laboratory  data  into  ERAs  of  contaminated 
sediments  is  essential  because  the  current  state-of-science  does  not  support  exclusive 
reliance  upon  either  type  of  measurement  endpoint.  As  a  simple  example,  laboratory 
assays  may  be  used  to  screen  for  the  presence/absence  of  toxicity,  while  subsequent  field 
surveys  could  be  performed  to  explore  how  the  toxicity  is  manifested  (e.g,  through  ab¬ 
sence  of  key  species,  reduced  biomass).  Without  the  field  component,  evaluation  of  the 
relevance  of  measurement  endpoints  as  indicators  of  assessment  endpoints  is  far  more 
uncertain.  However,  to  use  combinations  of  laboratory  and  field  data  for  probabilistic 
risk  assessments,  it  is  necessary  to  develop  quantitative  relationships  between  the  two 
types  of  endpoints.  Below  are  described  some  examples  of  how  laboratory  and  field  data 
concerning  the  biological  effects  of  contaminated  sediments  have  been  related  to  one 
another;  also  briefly  discussed  are  technical  approaches  relative  to  exposure  assessment 
for  complex  mixtures.  From  the  standpoint  of  laboratory  assays,  the  focus  is  on  those  that 
are  widely  accepted  and  have  had  some  degree  of  standardization,  under  the  assumption 
that  for  the  purposes  of  ERA  some  degree  of  standard  “currency”  is  desirable.  However, 
by  focusing  on  these  standard  methods,  it  is  not  my  intention  to  imply  that  other  types 
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of  species/endpoints  might  not  be  useful  for  sediment  assessments;  this  is  a  decision  that 
should  be  made  on  a  situation-specific  basis. 

8.3  Laboratory-to-field  extrapolation 

Organisms  selected  for  use  in  assessing  contaminated  sediments  in  the  laboratory  have 
included  bacteria,  plants,  clams,  annelids,  insects,  micro-  and  macro-crustaceans,  and 
fishes  (Giesy  and  Hoke  1989;  Burton  1991;  Ingersoll  1995).  Assays  employing  these  or¬ 
ganisms  have  been  conducted  with  test  fractions  ranging  from  solid  phase  (whole)  sedi¬ 
ments  to  aqueous  media  such  as  pore  water  and  elutriates  to  solvent  extracts  of 
sediments.  However,  the  most  commonly  utilized  laboratory  assays  for  either  freshwater 
or  marine  sediments  consist  of  exposure  of  benthic  macroinvertebrates  to  whole  sedi¬ 
ments,  For  marine  sediments,  standard  methods  have  been  developed  for  testing  the  am- 
phipods  Rhepoxynius  abronius,  Ampelisca  ahdita,  Leptocheirus  plumulosus,  and  Eohaustorius 
estuarius,  while  for  freshwater  sediments,  standard  assays  exist  for  the  amphipod  Hyaklla 
azteca  and  larvae  of  the  midge  Chironomus  tentans  (USEPA  1994a,  1994b).  Tests  with 
these  species  typically  are  10  days  in  length,  with  the  primary  endpoint  consisting  of  sur¬ 
vival  or  survival  and  growth  {e.g.,  C.  tentans).  These  particular  species  are  used  for  a  num¬ 
ber  of  reasons,  including  a)  general  technical  acceptance  by  the  scientific  community,  b) 
ease  of  culturing  and/or  handling  in  a  laboratory  setting,  c)  degree  of  contact  with  sedi¬ 
ments,  d)  availability  of  data  concerning  relative  sensitivity  to  contaminants,  e)  existing 
evaluations  of  within-  and  among-laboratory  test  variability,  and  f )  evidence  that  assay 
results  with  the  different  species  are  predictive  of  impacts  in  the  field. 

One  factor  that  limits  the  predictive  power  of  current  laboratory  tests  is  that  they  are 
relatively  short-term  assessments  of  field  situations  in  which  chronic  toxicity  may  be  a 
dominant  process.  In  recognition  of  this  limitation,  research  to  extend  test  duration  and 
incorporate  sublethal  endpoints  is  ongoing  with  several  species  that  are  used  in  the  10-d 
tests  {L  plumulosus,  H.  azteca,  C  tentans).  In  any  case,  the  discussion  below  is  germane  to 
the  use  of  either  short-  or  long-term  laboratory  assays. 

The  major  conceptual  problem  in  extrapolating  laboratory  measurement  endpoints  to 
population-level,  and  especially  community-level  measurement  (or  assessment)  end¬ 
points,  is  the  lack  of  a  mechanistic  understanding  of  the  relationship  between  the  two 
different  types  of  endpoints.  That  is,  the  laboratory  test  species  serve  only  as  sentinels  to 
detect  toxicity,  which  as  a  generic  measure  may  not  provide  a  basis  for  quantitative  pre¬ 
diction  of  effects  in  variable  field  settings.  This  is  due  primarily  to  a  lack  of  understand¬ 
ing  of  key  processes  controlling  population  and  community  dynamics,  relative  to  the 
response  of  individual  organisms.  To  address  this  problem,  scientists  involved  in  sedi¬ 
ment  assessments  have  sought  to  develop  empirical  relationships  between  the  results  of 
laboratory  assays  and  field  observations.  As  described  below,  this  certainly  is  a  viable 
approach,  however,  it  must  be  recognized  that  definition  of  these  types  of  relationships 
is  by  necessity  often  somewhat  site-specific,  i.e.,  dependent  both  on  the  suite  of  contami¬ 
nants  present  and  the  existing  (or  desired)  benthic  community. 
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There  are  at  least  two  levels  at  which  one  may  wish  to  extrapolate  results  of  tests  with 
laboratory  organisms  to  population/community-level  endpoints  in  the  field.  The  first 
scenario  is  that  in  which  responses  of  laboratory  organisms  are  used  to  predict  potential 
effects  on  populations  of  the  same,  or  similar,  species  in  the  field.  In  these  instances, 
simple  demographic  models  and/or  empirically  derived  relationships  can  be  used  to 
predict  the  effects  of  decreased  survival  and/ or  reproduction  (in  the  laboratory)  on  popu¬ 
lation  trends  in  the  field,  and  confirm  these  predictions  with  limited  field  analyses.  As 
one  example  of  this,  Wentsel  etal  (Wentsel,  McIntosh,  Anderson  1977;  Wentsel,  McIn¬ 
tosh,  Atchison  1977;  Wentsel  et  al  1978)  correlated  the  presence/absence  of  C  tentans  in 
metal-contaminated  lake  sediments  with  decreases  in  growth/emergence  (and,  hence,  re¬ 
production)  of  the  midge  in  laboratory  tests  with  the  field  sediments.  Another  example 
of  this  type  of  extrapolation  is  provided  by  Giesy  et  al  (1988),  who  reported  that  de¬ 
creases  in  the  growth  of  C  tentans  of  30%  or  more  in  laboratory  tests  with  sediments  from 
the  Detroit  River  corresponded  with  the  absence  of  chironomids  in  the  field.  In  a  study 
from  the  marine  environment,  Swartz  et  fl/.  (1985)  noted  that  the  toxicity  of  sediments 
along  a  contamination  gradient  on  the  Palos  Verdes  Shelf  to  R.  abronius  was  predictive  of 
the  relative  abundance  of  amphipods  in  the  field  (Table  8-1).  Swartz  et  al  (1994)  also  es¬ 
tablished  a  relationship  between  the  relative  abundance  of  amphipods  in  DDT-contami- 
nated  sediments  in  the  field  and  the  results  of  laboratory  toxicity  assays  with  R.  abronius, 
E.  estuarius,  and  H.  azteca. 

The  more  demanding  assessment  scenario  is  that  in  which  results  of  laboratory  assays 
with  one  or  more  of  the  standard  sentinel  species  must  be  extrapolated  to  the  field  to 
predict  population-level  effects  in  a  dissimilar  species,  or  more  difficult  yet,  community- 
level  impacts.  Swartz  etal  (1985)  reported  an  example  of  the  extrapolation  of  results  of 
laboratory  assays  with  R.  abronius  to  population-  and  community-level  impacts  in  con¬ 
taminated  sediments  from  the  Palos  Verdes  Shelf  In  this  case,  the  presence  of  significant 
toxicity  in  the  laboratory  was  correlated  not  only  with  marked  decreases  in  amphipod 
abundance,  but  with  species  richness  and  the  Infaunal  Index  (Table  8-1).  Hoke  et  al 
(1996)  also  provide  an  example  of  a  correlation  between  toxicity  of  sediments  in  the  labo¬ 
ratory  (in  this  case  to  C.  tentans  and  H.  azteca)  and  community-level  effects  on  benthic 
invertebrates  at  a  site  contaminated  by  DDT,  DDD,  and  DDE.  In  these  studies,  increas¬ 
ing  concentrations  of  total  DDTR  (DDT,  DDD,  and  DDE)  in  sediments  were  related  both 
to  increased  toxicity  in  laboratory  toxicity  tests  and  decreases  in  the  total  number  of  spe¬ 
cies  present  in  the  field  (Figure  8-1).  These  data  also  serve  to  illustrate  the  variability  that 
can  exist  in  relationships  between  laboratory  and  field  measurement  endpoints;  although 
there  is  an  inverse  relationship  between  species  richness  and  toxicity,  there  clearly  are 
instances  where  the  two  measurements  appear  to  be  disconnected.  Unfortunately,  in 
most  situations  it  is  impossible  to  ascertain  whether  a  seeming  lack  of  agreement  be¬ 
tween  measurement  endpoints  is  related  to  a  lack  of  biological  coherence  or  simply  to 
sampling  artifacts. 

These  examples  of  the  extrapolation  of  laboratory  results  to  field  settings  are  not  in¬ 
tended  to  be  exhaustive  but  merely  illustrative  of  how  assay  results  have  been  used  in  this 
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Table  8-1  Toxicity  to  Rhepoxynius  abronius  of  sediments  from  the  Palos  Verdes  Shelf  compared  to 
measurements  of  benthic  community  structure 


Site 

Parameter 

1 

2 

3 

4 

5 

6 

7 

R.abronius  survival  % 

IT 

80^ 

80" 

92 

84 

97 

95 

Species  richness  (S/O.lm^ 

15.8 

18.6 

28.2 

41.2 

96.6 

72.4 

73.4 

Infaunal  index 

4,6 

2.8 

18.4 

37.1 

51.8 

63.0 

80.8 

Amphipod  density 

0 

0 

2.4 

1.6 

46.0 

52.2 

65.4 

(N/O.lm^) 

^Significantly  less  than  control. 
Source:  Swartz  cf  a/.  1985 


context.  It  is  important  to  reiterate  that  predictive  relationships  between  most  commonly 
utilized  laboratory  and  field  measurement  endpoints  (particularly  at  the  community 
level)  remain  somewhat  site-specific  in  nature.  Clearly,  these  types  of  extrapolation  ef¬ 
forts  would  be  enhanced  by  more  sensitive  (chronic)  assays  as  well  as  by  a  quantitative 
understanding  of  how  effects  in  one  or  more  laboratory  species  might  translate  into  im¬ 
pacts  at  the  population/community  level. 

8.4  Cause  and  effect 

One  important  insight  that  greatly  enhances  the  development  of  mechanistic  models  of 
relationships  between  laboratory  and  field  measurement  endpoints  is  the  ability  to  iden¬ 
tify  contaminants  responsible  for  sediment  toxicity.  As  mentioned  above,  an  important 
shortcoming  of  measurement  endpoints  based  solely  on  field  observations  is  the  lack  of 
definition  of  cause-and-effect  relationships.  However,  because  most  contaminated  sedi¬ 
ments  contain  mixtures  of  potentially  toxic  chemicals,  traditional  laboratory  toxicity 
assays,  in  and  of  themselves,  also  suffer  from  this  shortcoming.  But  through  the  applica¬ 
tion  of  bioavailability  predictions,  structure  activity,  and  interactive  toxicity  models,  re¬ 
cent  studies  have  shown  that  it  is  possible  to  identify  those  compounds  responsible  for 
sediment  toxicity,  and  thereby  develop  plausible  dose-response  relationships.  For  ex¬ 
ample,  Swartz  etal  (1994)  recently  described  the  application  of  bioavailability  and  “toxic 
units”  models  to  differentiate  sediment  toxicity  due  to  DDT  from  contributions  of  PAHs, 
dieldrin,  and  metals.  Swartz  et  al  (1995)  also  describe  the  use  of  bioavailability,  quanti¬ 
tative  structure  activity,  and  interactive  toxicity  models  to  make  quantitative  predictions 
of  the  toxicity  of  sediments  contaminated  with  complex  mixtures  of  PAHs. 

Another  approach  to  the  identification  of  contaminants  responsible  for  sediment  toxic¬ 
ity  is  via  the  use  of  toxicity-based  fractionation  schemes,  commonly  termed  TIEs.  Ankley 
and  Schubauer-Berigan  (1995)  provide  an  overview  of  TIE  methods,  and  describe  the  ap¬ 
plication  of  these  procedures  to  contaminated  sediments.  It  also  is  possible  to  use  TIE 
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and  DDE)  to  Chironomus  tentans  and  Hyalella  azteca  versus  number  of  species  in  the  field 
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procedures  in  conjunction  with  toxic  units/mixture  models  and  predictions  of  contami¬ 
nant  bioavailability.  For  example,  Ankley  et  al  (1991)  used  basic  TIE  manipulations  to 
help  explore  the  utility  of  a  proposed  bioavailability/toxicity  model  for  cationic  metals  in 
sediments. 

Although  diagnostic  methods  such  as  TIEs  and  bioavailability/interactive  toxicity  models 
thus  far  have  been  applied  only  infrequently  to  the  interpretation  of  sediment  toxicity,  I 
feel  that  they  are  critical  to  ensuring  the  success  of  ERAs  with  contaminated  sediments. 
The  results  of  these  types  of  analyses  not  only  facilitate  a  mechanistic  understanding  of 
relationships  between  measurement  and  assessment  endpoints,  but  they  help  provide  a 
solid  technical  underpinning  for  the  identification  of  appropriate  remedial  options. 
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Ecological  significance  of  endpoints 
used  to  assess  sediment  quality 

William  H.  Clements 


9.1  Introduction 

Criteria  for  selecting  endpoints  in  ERAs  include  susceptibility  to  the  stressor,  ease  of 
measurement,  unambiguous  definitions,  and  societal  and  ecological  relevance  (Suter 
1993).  To  link  sediment  quality  assessments  with  ERAs,  similar  criteria  should  be  em¬ 
ployed  to  select  endpoints  to  assess  sediment  quality.  Ecological  relevance  of  population, 
community,  and  ecosystem  responses  to  contaminants  has  received  relatively  little  atten¬ 
tion,  although  some  scientists  argue  that  this  is  probably  the  most  important  criterion 
(Cairns  1986).  If  the  ultimate  objective  of  an  ERA  is  to  protect  ecological  integrity,  it  fol¬ 
lows  that  ecological  relevance  of  endpoints  should  be  given  greater  priority. 

One  of  the  challenges  that  we  face  when  conducting  any  ERA  is  distinguishing  ecologi¬ 
cally  important  changes  from  change  per  se.  In  other  words,  simply  because  we  are  able 
to  demonstrate  statistically  significant  differences  between  reference  and  polluted  sites 
does  not  necessarily  mean  that  biological  or  ecological  integrity  {sensu  Karr  and  Dudley 
1981)  is  compromised.  More  importantly,  of  the  myriad  of  endpoints  that  have  been 
shown  to  respond  to  contaminants  in  sediments,  which  ones  indicate  significant  ecologi¬ 
cal  effects?  Although  this  problem  is  most  critical  for  sublethal  effects  measured  at  lower 
levels  of  organization  {e.g.,  physiological  and  biochemical  responses),  ecological  relevance 
of  changes  at  higher  levels  of  organization  (populations,  communities,  and  ecosystems) 
should  also  be  evaluated.  Because  it  is  unlikely  that  any  single  endpoint  will  be  useful  in 
all  situations,  a  suite  of  measures  that  integrate  responses  across  levels  of  organization  is 
necessary  (Karr  1991;  Karr  1993;  Clements  and  Kiffney  1994a).  I  suggest  that  linking 
responses  among  levels  of  organization  and  correlating  sublethal  responses  with  signifi¬ 
cant  ecological  effects  is  an  important  area  of  research  for  sediment  risk  assessments 
(Clements  and  Kiffney  1994b). 

Ecologically  relevant  effects  of  contaminants  on  freshwater  ecosystems  may  be  divided 
into  three  general  categories.  First,  direct  effects  of  contaminants  may  include  shifts  in 
community  composition,  reduced  species  diversity,  or  changes  in  ecosystem  function. 
These  direct  effects  generally  result  from  differences  in  sensitivity  among  taxa.  Secondly, 
indirect  effects  of  contaminants  may  include  alterations  in  predator-prey  or  competitive 
interactions.  For  example,  lower  abundance  of  a  particular  species  may  result  from  re¬ 
duced  abundance  of  its  prey  or  increased  abundance  of  a  competitor.  Finally,  direct  and 
indirect  effects  of  contaminants  on  benthic  communities  may  influence  higher  trophic 
levels  via  loss  of  important  prey  resources  or  food  chain  transfer  of  contaminants.  The 
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primary  objective  of  this  presentation  is  to  discuss  the  ecological  significance  of  end¬ 
points  used  to  evaluate  contaminated  sediments.  1  will  briefly  discuss  the  appropriate¬ 
ness  of  using  benthic  communities  in  sediment  risk  assessments  and  argue  that 
structural  and  functional  endpoints  at  several  levels  of  organization  are  necessary  to 
evaluate  contaminated  sediments.  Using  studies  from  the  primary  literature  and  results 
of  my  research  conducted  at  a  USEPA  Superfund  site,  I  will  describe  the  importance  of 
measuring  direct,  indirect,  and  food  chain  effects  of  sediment  contaminants.  Finally,  I 
will  provide  recommendations  for  future  research  to  evaluate  ecological  significance  of 
endpoints  used  in  sediment  risk  assessments. 

9.2  Focus  on  benthic  communities 

Most  assessments  of  sediment  contamination  have  focused  on  responses  of  benthic 
macroinvertebrates.  The  use  of  benthic  macroinvertebrate  communities  to  assess  effects 
of  contaminants  on  aquatic  ecosystems  has  a  long  history  (Cairns  and  Pratt  1993),  and 
changes  in  composition  of  benthic  communities  have  been  used  as  indicators  of  water 
quality  since  the  early  1900s  (Carpenter  1924;  Richardson  1929).  Because  of  the  attention 
that  benthic  communities  have  received  in  water  quality  assessments,  unambiguous 
operational  definitions  of  community  responses  to  contaminants  have  been  developed 
(Plafkin  et  al.  1989).  Because  of  their  intimate  association  with  sediments,  benthic  com¬ 
munities  are  highly  susceptible  to  sediment  contaminants.  Benthic  macroinvertebrates 
are  important  in  the  function  of  aquatic  ecosystems  and  in  the  diet  of  higher  trophic  lev¬ 
els  {e.g.,  sport  fish).  Therefore,  endpoints  associated  with  benthic  macroinvertebrates 
have  both  ecological  and  societal  relevance.  Because  benthic  macroinvertebrates  readily 
accumulate  sediment  contaminants,  they  represent  an  important  link  to  these  higher 
trophic  levels.  Finally,  assessments  of  benthic  community  structure  provide  data  that 
may  be  used  to  assess  integrated  responses  at  several  levels  of  organization  simulta¬ 
neously.  For  example,  Kerans  and  Karr  (1994)  have  recently  developed  an  index  of  bio¬ 
logical  integrity  for  benthic  communities,  which  integrates  structural  and  functional 
measures  into  a  single  value. 

9.3  Criticisms  of  single-species  toxicity  tests 

Criticisms  of  single-species  laboratory  toxicity  tests  for  predicting  effects  of  contaminants 
on  natural  populations,  communities,  and  ecosystems  are  well  known  (Cairns  1986).  The 
basis  for  most  of  this  criticism  is  the  generally  untested  assumption  that  laboratory  re¬ 
sponses  of  surrogate  species  are  protective  of  ecosystem  structure  and  function.  Valida¬ 
tion  of  single-species  toxicity  tests,  using  more  sophisticated  testing  systems  such  as 
microcosms,  mesocosms,  and  direct  field  experimentation,  have  been  employed  to  test 
this  assumption.  Good  correspondence  between  laboratory  toxicity  tests  and  field  assess¬ 
ments  has  been  reported;  however,  in  many  instances  failure  to  account  for  indirect  ef¬ 
fects  has  resulted  in  underestimates  of  contaminant  effects  (Schindler  1987;  Gonzalez 
and  Frost  1994). 
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Similar  criticisms  of  single-species  toxicity  tests  are  also  applicable  to  sediment  toxicity 
tests,  in  which  growth  rates  or  survivorship  of  surrogate  benthic  species  (e.g.,  Chironomus 
tentans,  Hyalella  azteca)  are  employed  to  predict  effects  of  contaminated  sediments  on 
natural  communities  in  the  field.  Failure  to  account  for  indirect  effects  and  the  lack  of 
research  on  effects  of  sediment  contaminants  on  ecosystem  processes  limits  the  applica¬ 
bility  of  sediment  toxicity  tests.  Development  of  approaches  that  integrate  sediment  tox¬ 
icity  tests  with  field  measurements,  such  as  the  sediment  quality  triad  (Chapman  1989; 
Chapman  et  al  1991)  will  improve  the  predictive  ability  of  simple  laboratory  procedures. 

9.4  Structural  and  functional  measures 

Ecotoxicologists  generally  distinguish  between  two  types  of  responses  to  contaminants: 
structural  and  functional.  Structural  responses  are  typically  associated  with  community- 
level  measures  and  include  estimates  of  abundance,  species  richness,  diversity,  and  com¬ 
munity  composition.  These  measures  have  received  the  most  attention  in  field 
assessments  of  contaminated  sediments.  Functional  measures  generally  include  ecosys¬ 
tem  processes,  such  as  the  rates  of  productivity,  nutrient  cycling,  energy  flow,  and  decom¬ 
position.  Although  there  has  been  recent  interest  in  using  ecosystem-level  endpoints  in 
hazard  assessments,  functional  measures  have  been  largely  ignored  in  assessments  of 
contaminated  sediments.  Hill  (1992)  has  developed  a  relatively  simple  procedure  for 
measuring  community  respiration  in  sediments  and  has  shown  that  changes  in  respira¬ 
tion  rates  are  indicative  of  contaminated  sediments.  The  effects  of  sediment  contami¬ 
nants  on  ecosystem  processes  will  vary  depending  on  the  size  of  the  system.  Pelagic 
processes  dominate  ecosystem  function  in  large  lakes  and  rivers,  whereas  benthic  pro¬ 
cesses  are  generally  more  important  in  smaller  lakes  and  streams  because  of  the  greater 
ratio  of  sediment  to  water  volume  (Reice  and  Wohlenberg  1993). 

There  is  debate  in  the  literature  concerning  whether  structural  or  functional  measures  are 
more  appropriate  for  assessing  ecological  integrity  (Cairns  and  Pratt  1986).  Because  of 
functional  redundancy  of  ecosystems  and  variability  of  ecosystem  responses,  ecosystem 
processes  may  be  relatively  insensitive  to  stress  (Cairns  and  Pratt  1986;  Schindler  1987). 
Despite  the  criticism  of  single-species  toxicity  tests  described  above,  population  re¬ 
sponses  to  contaminants  in  the  field  may  be  very  sensitive  to  stress.  Included  in  Odum’s 
(1992)  top  20  list  of  “great  ideas  in  ecology”  is  the  concept  that  “the  first  signs  of  environ¬ 
mental  stress  usually  occur  at  the  population  level,  affecting  especially  sensitive  species.” 

Structural  and  functional  responses  are  so  intimately  related  that  a  distinction  is  some¬ 
what  artificial.  Indeed,  a  holistic  perspective  of  ecology  requires  that  we  consider  commu¬ 
nities  and  ecosystems  as  more  than  simply  the  sum  of  their  component  populations 
(Webster  1979).  Because  biological  systems  are  arranged  in  a  hierarchical  fashion,  sedi¬ 
ment  contaminants  will  probably  affect  structural  and  functional  responses  simulta¬ 
neously.  Differential  sensitivity  of  benthic  populations  often  results  in  structural  changes 
in  benthic  communities.  These  changes  in  community  composition  may  have  important 
cascading  effects  on  ecosystem  function.  For  example,  Wallace  et  al.  (1982)  reported  that 
reduced  abundance  of  shredders  in  streams  treated  with  insecticides  resulted  in  changes 
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in  organic  matter  processing.  Stewart  and  Hill  (1993)  speculate  that  removal  of  grazing 
snails  had  indirect  effects  on  higher  trophic  levels  and  increased  the  movement  of  toxi¬ 
cants  through  a  lotic  food  web.  Just  as  ecosystem-level  assessments  complement  assess¬ 
ments  at  lower  levels  of  organization  (Suter  1993),  understanding  population  responses 
to  contaminants  provides  information  necessary  to  elucidate  mechanisms  of  changes  at 
higher  levels.  1  suggest  that  assessments  of  sediment  quality  should  include  both  struc¬ 
tural  and  functional  measures  at  several  levels  of  organization. 

9.5  Direct  effects  of  sediment  contaminants  on  benthic 
communities 

Most  research  on  contaminant  effects  in  freshwater  ecosystems  has  focused  on  direct 
effects  (Clements  1991).  In  particular,  shifts  in  community  composition  and  replacement 
of  sensitive  species  by  tolerant  species  have  received  considerable  attention.  For  example, 
we  know  that  certain  groups  of  organisms,  such  as  mayflies,  are  highly  sensitive  to  heavy 
metals,  whereas  other  groups,  such  as  many  caddisflies  and  most  orthoclad  chironomids 
are  quite  tolerant.  Thus,  shifts  in  community  composition  observed  at  metal-polluted 
locations  relative  to  reference  sites  may  be  a  result  of  the  loss  of  sensitive  populations 
(Clements  et  al  1992).  Because  reduced  species  richness  or  diversity  may  result  in  re¬ 
duced  ecosystem  stability  or  trophic  complexity  (Connell  1978;  Pimm  1984),  changes  in 
these  endpoints  are  ecologically  relevant.  Similarly,  reduced  abundance  of  keystone  spe¬ 
cies  may  affect  ecological  integrity.  Keystone  species  have  been  identified  in  several  ma¬ 
rine  ecosystems,  especially  rocky  intertidal  habitats;  however,  considerably  less  research 
has  been  conducted  in  freshwater  systems,  and  few  studies  have  documented  the  role  of 
keystone  species  in  lakes  and  streams.  The  lack  of  research  in  freshwater  ecosystems  does 
not  mean  that  keystone  species  are  absent,  but  rather  demonstrates  the  difficulty  measur¬ 
ing  the  role  of  keystone  species.  Experimental  manipulation  is  the  most  direct  way  to 
demonstrate  the  importance  of  species  in  an  ecosystem.  Experimental  manipulation  is 
inherently  more  difficult  in  lakes  and  streams  than  in  the  rocky  intertidal  zone  where 
species  are  easily  removed  from  an  essentially  two-dimensional  habitat.  Therefore,  docu¬ 
menting  the  role  of  keystone  species  will  be  very  difficult  in  freshwater  ecosystems. 

Direct  effects  of  contaminants  on  ecosystem  function  are  also  ecologically  relevant  and 
should  be  considered  in  assessments  of  contaminated  sediments.  As  noted  above,  there 
is  debate  among  researchers  regarding  the  sensitivity  of  ecosystem-level  endpoints;  how¬ 
ever,  protection  of  ecological  integrity  should  also  include  protection  of  functional  integ¬ 
rity.  Several  functional  endpoints  have  been  used  to  measure  effects  of  contaminants  on 
aquatic  ecosystems,  including  changes  in  primary  and  secondary  productivity,  decompo¬ 
sition  rates,  and  nutrient  cycling  (Rapport  etal.  1985).  Few  studies  have  examined  effects 
of  contaminated  sediments  on  ecosystem  function  in  the  field. 
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9.6  Influence  of  location  and  previous  exposure  on 
responses  to  contaminants 

Because  direct  effects  of  contaminants  on  benthic  communities  may  vary  spatially,  care 
must  be  taken  to  select  endpoints  that  show  similar  responses  among  locations.  Kiffhey 
and  Clements  (1994)  report  that  benthic  communities  from  small,  high  elevation  streams 
are  more  sensitive  to  metals  than  those  from  larger,  low  elevation  streams.  Previous  expo¬ 
sure  to  contaminants  may  also  affect  population-,  community-,  and  ecosystem-level  re¬ 
sponses.  While  acclimation  or  adaptation  of  benthic  populations  to  contaminants  has 
been  measured  (Bryan  and  Hummerstone  1971;  Klerks  and  Weis  1987;  Klerks  and 
Levinton  1989, 1993),  few  studies  have  examined  this  process  at  higher  levels  of  organi¬ 
zation  (Neiderlehner  and  Cairns  1992).  Using  stream  microcosms,  I  have  observed  that 
benthic  communities  from  sites  polluted  with  low  concentrations  of  heavy  metals  were 
more  tolerant  to  metals  than  communities  from  unpolluted  locations.  If  variation  in  sus¬ 
ceptibility  to  contaminants  among  locations  is  a  general  phenomenon,  results  of  these 
studies  have  important  implications  for  developing  SQC. 

9.7  Indirect  effects  of  sediment  contaminants  on  benthic 
communities 

Shifts  in  community  composition  may  also  result  from  indirect  effects  of  contaminants 
on  species  interactions,  such  as  competition  and  predation.  There  is  theoretical  and 
empirical  support  for  the  hypothesis  that  species  interactions  play  a  major  role  in  struc¬ 
turing  freshwater  and  marine  benthic  communities  (Connell  1961;  Dayton  1971, 1975; 
Menge  1976;  Peckarsky  and  Dodson  1980;  Walde  and  Davies  1984;  Walde  1986;  Menge 
and  Sutherland  1987;  Hart  1992).  In  particular,  studies  of  predation  and  competition 
have  received  attention  from  benthic  ecologists  for  many  years,  and  there  is  general 
agreement  among  ecologists  that  these  interactions  are  important  in  some  habitats. 
Again,  much  of  the  experimental  research  demonstrating  direct  effects  of  predation  or 
competition  on  community  structure  has  been  conducted  in  marine  rocky  intertidal 
habitats;  however,  ecologists  have  begun  to  examine  the  role  of  species  interactions  in 
freshwater  benthic  communities  (Peckarsky  and  Dodson  1980;  Walde  and  Davies  1984; 
Walde  1986;  Hart  1992). 

Ecologists  have  long  recognized  the  importance  of  interactions  between  abiotic  factors, 
such  as  environmental  stress,  and  biotic  factors  (Park  1962;  Tilman  1977;  Dunson  and 
Travis  1991).  Surprisingly,  the  effect  of  contaminants  on  biotic  interactions,  such  as  pre¬ 
dation  and  competition,  has  been  largely  ignored  in  ecotoxicology.  Warner  et  al  (1993) 
found  that  interspecific  competition  between  anurans  was  influenced  by  acidification. 
Dunson  and  Travis  (1991)  observed  that  competition  for  food  between  killifish  was  influ¬ 
enced  by  salinity.  Clements  et  al  (1989)  reported  that  net-spinning  caddisflies 
(Trichoptera:  Hydropsychidae)  were  more  susceptible  to  stonefly  (Plecoptera:  Perlidae) 
predation  in  streams  dosed  with  copper  than  in  control  streams.  Finally,  Wipfli  and 
Merritt  (1994)  reported  that  reduced  black  fly  density  in  a  larvicide-treated  stream  al¬ 
tered  species  interactions.  They  concluded  that  community  structure  may  be  indirectly 
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affected  by  applications  of  larvicides.  These  studies  suggest  that  abiotic  factors  may  influ¬ 
ence  the  outcome  of  species  interactions  and  indirectly  alter  community  composition. 

The  effects  of  contaminants  on  species  interactions  will  likely  depend  on  the  relative  sen¬ 
sitivities  of  the  interacting  species  (Clements  et  al  1989;  Arnott  and  Vanni  1993;  Warner 
et  al  1993),  Menge  and  Sutherland  (1987)  developed  a  model  of  community  regulation 
that  predicts  changes  in  the  relative  importance  of  species  interactions  along  environ¬ 
mental  stress  gradients.  According  to  this  model,  disturbance  is  the  most  important  fac¬ 
tor  structuring  communities  at  extreme  levels  of  environmental  stress,  whereas  at 
intermediate  and  low  levels  of  stress,  the  importance  of  competition  and  predation  in¬ 
creases.  If  environmental  contaminants  may  be  considered  a  type  of  environmental 
stress,  then  similar  responses  may  occur  along  contaminant  stress  gradients.  With  re¬ 
spect  to  the  importance  of  predation,  the  specific  outcome  of  environmental  stress  mod¬ 
els  differs  depending  on  whether  consumers  or  their  prey  are  more  susceptible  to  stress 
(Menge  and  Olson  1990).  For  example,  if  predators  are  more  sensitive  to  stress  than  their 
prey,  the  consumer  stress  model  predicts  that  predation  rates  will  be  lower  in  high-stress 
environments.  This  pattern  was  observed  in  rocky  intertidal  habitats  subjected  to  ex¬ 
treme  wave  action  (Menge  and  Olson  1990),  streams  where  the  hydraulic  regime  was 
unfavorable  to  predators  (Peckarsky  etal  1990),  and  in  acidified  lakes  (Locke  and  Sprules 
1994).  In  contrast,  if  prey  are  more  sensitive  to  stress,  the  prey  stress  model  predicts  that 
predation  rates  will  be  greater  in  stressful  environments.  This  pattern  was  observed  in 
the  predator-prey  experiments  between  stoneflies  and  caddisflies  described  above 
(Clements  1994).  Regardless  of  the  direction  of  effects,  these  studies  demonstrate  that  in¬ 
direct  effects  of  contaminants  on  benthic  communities  are  complex  and  may  complicate 
assessments  of  direct  effects. 


One  of  the  most  important  reasons  for  investigating  indirect  effects  is  that  species  inter¬ 
actions  may  actually  be  more  sensitive  to  contaminants  than  direct  effects.  For  example, 
field  and  laboratory  experiments  have  shown  that  filter-feeding  caddisflies  (Trichoptera: 
Hydropsychidae)  are  relatively  insensitive  to  Cu  exposure  (Clements  et  al  1992).  How¬ 
ever,  susceptibility  of  caddisflies  to  predation  by  stoneflies  increased  when  both  groups 
were  exposed  to  metals  (Clements  et  al  1989).  Similar  results  were  reported  by  Kiflfney 
(1995)  for  these  same  groups. 


The  lack  of  studies  on  indirect  effects  of  contaminants  on  species  interactions  is  surpris¬ 
ing,  given  the  prominent  role  that  research  on  species  interactions  has  played  in  aquatic 
ecology.  The  lack  of  research  on  indirect  effects  is  probably  related  to  the  difficulty  of 
designing  and  implementing  field  experiments.  In  the  laboratory,  relatively  complex 
experimental  designs  are  necessary  to  separate  direct  effects  of  contaminants  from  the 
indirect  effects  on  species  interactions.  Separating  these  effects  in  the  field  would  first 
require  demonstrating  that  species  interactions  are  important,  and  then  demonstrating 
that  these  interactions  are  affected  by  contaminants.  Both  tasks  require  experimentally 
manipulating  abundances  of  several  species.  For  example,  by  conducting  caging  experi¬ 
ments  in  reference  and  polluted  locations,  one  could  evaluate  the  effects  of  predation  on 
both  communities.  If  predation  effects  were  greater  in  the  contaminated  sites,  this  would 
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support  the  hypothesis  that  contaminants  increase  the  susceptibility  of  prey  organisms 
to  predation.  Alternatively,  microcosm  experiments  using  communities  obtained  from 
reference  and  polluted  locations  could  be  employed  to  measure  effects  of  contaminants 
on  species  interactions.  In  experimental  streams  I  have  measured  effects  of  stonefly  pre¬ 
dation  on  benthic  invertebrate  communities  obtained  from  reference  and  metal-polluted 
locations.  Preliminary  results  show  that  communities  from  contaminated  sites  were 
more  susceptible  to  stonefly  predation  than  communities  from  reference  sites. 

9.8  Effects  on  higher  trophic  levels 

Alterations  in  benthic  macroinvertebrate  communities  resulting  from  contaminated 
sediment  can  have  negative  effects  on  fish  predators.  These  impacts  occur  either  through 
loss  of  preferred  prey  resources  or  by  food  chain  transfer  of  contaminants.  Although 
numerous  studies  have  measured  transfer  of  contaminants  from  benthic  macroinverte¬ 
brates  to  fish  (Reynoldson  1987;  Dallinger  et  al  1987;  Hatakeyama  and  Yasuno  1987; 
Douben  1989;  Clements  et  al  1994;  Woodward  et  al  1994),  relatively  few  studies  have 
examined  pollution-induced  changes  in  prey  availability  and  predator  feeding  habits 
(Jefree  and  Williams  1980;  Clements  and  Livingston  1983;  Stair  et  al  1984;  Rees  1994). 
Because  many  fish  predators  are  opportunistic  feeders,  it  is  hypothesized  that  pollution- 
induced  changes  in  prey  abundance  will  alter  feeding  habits.  For  example,  differences  in 
macroinvertebrate  abundance  and  community  composition  between  reference  and 
metal-polluted  sites  at  the  Arkansas  River  (a  USEPA  Superfimd  site)  altered  feeding  hab¬ 
its  of  brown  trout  (Salmo  trutta)  collected  from  these  two  sites  (Rees  1994).  Jefree  and 
Williams  (1980)  reported  similar  results  for  the  purple-striped  gudgeon  (Mogurnda 
mogurnda)  and  speculated  that  selective  predation  by  tolerant  predators  on  abundant 
prey  species  in  polluted  habitats  may  dampen  differences  between  reference  and  polluted 
sites.  While  these  studies  demonstrate  that  opportunistic  predators  are  capable  of 
switching  to  more  abundant  prey  resources  at  polluted  locations,  changes  in  feeding 
habits  may  infiuence  predator  energy  budgets.  Optimal  foraging  theory  predicts  that  if  a 
predator  s  diet  is  influenced  by  natural  selection,  animals  will  have  the  greatest  fitness  if 
they  maximize  energy  intake  and  minimize  energy  expenditures  (Werner  and  Hall  1974). 
Although  other  factors,  such  as  risk  of  predation  and  habitat  availability,  also  influence 
predator  fitness,  these  simple  models  suggest  that  any  anthropogenic  changes  in  feeding 
habits  will  affect  a  predator’s  energy  budget.  For  example,  shifts  in  brown  trout  feeding 
habits  observed  at  polluted  sites  in  the  Arkansas  River,  from  relatively  large  organisms 
such  as  mayflies  and  stoneflies  to  small  orthoclad  chironomids,  may  increase  energy  ex¬ 
penditures  and  have  negative  impacts  on  brown  trout  populations  (Rees  1994).  Similar 
responses  are  likely  to  occur  in  other  situations  where  prey  communities  are  altered  by 
sediment  contaminants. 

Bioaccumulation  of  sediment  contaminants  by  benthic  macroinvertebrates  is  frequently 
measured  in  assessments  of  sediment  quality.  Because  benthic  macroinvertebrates  may 
accumulate  contaminants  from  sediments  and  transfer  these  contaminants  to  higher 
trophic  levels,  the  concentration  of  contaminants  in  benthic  organisms  is  ecologically 
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relevant.  Food  chain  transfer  of  sediment  contaminants  from  benthic  macroinvertebrates 
to  fish  and  the  influence  of  food  chain  structure  on  contaminant  levels  in  top  predators 
has  been  considered  (Rasmussen  etal.  1990;  Stewart  and  Hill  1993;  Clements  etal  1994; 
MacDonald  et  al  1994).  There  is  some  controversy  over  the  relative  importance  of  di¬ 
etary  and  aqueous  routes  of  exposure  to  fish,  particularly  for  heavy  metals;  however,  sev¬ 
eral  studies  have  shown  that  dietary  exposure  is  significant  (Hatakeyama  and  Yasuno 
1987;  Douben  1989;  Clements  et  al  1994).  More  importantly,  researchers  have  demon¬ 
strated  significant  effects  on  growth  and  survivorship  of  fish  feeding  on  contaminated 
prey  (Woodward  et  al  1994). 

Because  levels  of  contaminants  in  benthic  organisms  are  often  orders  of  magnitude 
higher  than  those  in  the  water  column,  it  follows  that  food  chain  transfer  may  be  impor¬ 
tant.  Even  for  contaminants  such  as  heavy  metals  where  dietary  transfer  from  benthic 
invertebrates  to  fish  is  relatively  inefficient,  concentrations  in  fish  can  still  reach  harmful 
levels  owing  to  selective  consumption  of  contaminated  prey  (Dallinger  and  Kautzky 
1985;  Dallinger  et  al  1987).  This  “food  chain  effect”  hypothesized  by  Dallinger  et  al 
(1987)  is  influenced  by  mechanisms  of  tolerance  of  prey  populations.  If  tolerant  prey 
compartmentalize  or  sequester  contaminants,  exposure  to  higher  trophic  levels  may  be 
enhanced.  Finally,  because  concentrations  of  contaminants  will  most  likely  vary  signifi¬ 
cantly  among  prey  taxa  (Kiffney  and  Clements  1993),  feeding  habits  and  predator  pref¬ 
erences  of  fish  will  also  influence  contaminant  accumulation  (Rees  1994). 

In  summary,  measurements  of  contaminant  levels  in  benthic  organisms  are  potentially 
useful  measures  of  sediment  quality  if  these  measurements  can  be  linked  to  direct  effects 
on  benthic  organisms  or  to  transfer  to  higher  trophic  levels.  This  latter  step  will  require 
detailed  information  on  prey  abundance,  contaminant  levels  in  prey,  laboratory  studies 
of  transfer  efficiency,  and  feeding  habits  of  predators. 

9.9  Recommendations  for  future  research 

•  Studies  that  integrate  structural  and  functional  responses  at  several  levels  of 
organization  are  necessary.  Studies  conducted  at  lower  levels  of  organization 
will  help  elucidate  mechanisms  responsible  for  changes  at  higher  levels. 

•  More  studies  documenting  the  correspondence  or  lack  of  correspondence 
between  simple  laboratory  toxicity  tests  and  field  assessments  are  necessary. 

•  More  consideration  of  the  indirect  effects  of  contaminants  on  species  interac¬ 
tions  is  necessary.  Assessing  the  importance  of  these  indirect  effects  will  be 
difficult  and  will  require  experimental  manipulation.  There  is  a  rich  body  of 
literature  on  disturbance  theory  and  the  influence  of  disturbance  on  species 
interactions  that  should  be  considered  when  designing  field  studies. 

•  Measuring  bioaccumulation  of  sediment  contaminants  and  food  chain  transfer 
to  higher  trophic  levels  is  an  important  area  of  research.  However,  in  order  to 
improve  the  ecological  relevance  of  these  endpoints,  tissue  concentrations  of 
contaminants  must  be  linked  to  some  estimate  of  ecological  effects. 
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10.1  Introduction 

Many  contaminants  in  fresh  waters  are  insoluble  and  may  attach  to  suspended  particu¬ 
late  material  that  eventually  settles  on  the  bottom  of  aquatic  ecosystems.  As  a  conse¬ 
quence,  sediments  can  act  as  a  sink  for  an  array  of  organic  and  inorganic  contaminants, 
which  can  affect  the  organisms  that  live  in  or  near  the  sediment.  These  organisms  may 
then  mediate  contaminant  transfer  back  to  the  water  column  or  to  other  trophic  levels. 

Ecological  risk  assessment  is  a  process  for  determining  the  probability  of  adverse  ecologi¬ 
cal  effects  in  an  ecosystem  resulting  from  exposure  to  stressors  (USEPA 1992;  Parkhurst 
et  al.  1994;  Calow  1995).  Ecological  risk  assessment  is  currently  the  focus  of  several  inten¬ 
sive  research  programs,  and  methods  are  being  developed  for  collecting  chemical  and 
biological  data  for  inclusion  in  mathematical  models  used  by  environmental  managers 
{e.g.,  Burns  et  al  1994). 

Ecological  risk  assessment  of  sediment  contamination  is  a  subdiscipline  of  ERA.  Sedi¬ 
ment  ecological  risk  assessment  has  been  applied  to  ponds,  embayments,  the  bottom 
layers  of  lakes  and  estuaries,  and  the  depositional  areas  of  streams  and  rivers  {e.g., 
Canfield  et  al  1994;  Pastorok  et  al  1994).  As  with  many  other  toxicological  approaches 
used  in  fresh  waters,  SERAs  frequently  focus  on  henthic  invertebrates  (or  benthos).  These 
organisms  are  used  for  the  following  reasons  (La  Point  and  Fairchild  1992;  Rosenberg 
and  Resh  1993a;  Metcalfe-Smith  1994;  Davis  1995;  Resh  1995):  1)  they  have  been  used 
historically  in  assessing  environmental  degradation;  2)  they  are  widespread  and  can  be 
affected  by  environmental  perturbations  in  many  different  types  of  freshwater  systems; 
3)  they  live  on  and  in  the  segments  and  so  are  directly  associated  with  chemical  contami¬ 
nants;  4)  the  presence  of  a  large  number  of  species  offers  a  spectrum  of  responses  to  en¬ 
vironmental  stress;  5)  their  sedentary  nature  allows  effective  spatial  analysis  of  pollutant 
or  disturbance  effects;  6)  they  have  life  cycles  of  months  or  years,  and  so  can  act  as  con¬ 
tinuous  monitors  of  environmental  quality;  and  7)  their  responses  can  be  quantified  in  a 
manner  that  can  be  understood  by  managers,  regulators,  and  the  general  public. 

Most  SERAs  attempt  to  measure  the  abundance  and  richness  of  the  benthos  in  specific 
areas  of  contamination,  and  then  to  compare  the  results  to  those  obtained  at  control  or 
reference  sites  (Canfield  et  al  1994;  Pastorok  et  al  1994;  Barbour  et  al  1995).  Much  has 
been  written  to  describe  the  collection,  identification,  and  analysis  of  benthic  data  for 
biological  monitoring  of  aquatic  ecosystems  (for  reviews,  see  Hellawell  1986;  Plafkin  et  al 
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1989;  Rosenberg  and  Resh  1993b;  Metcalfe-Smith  1994).  However,  despite  numerous 
studies  documenting  alterations  of  benthic  invertebrate  communities  as  a  result  of  envi¬ 
ronmental  stress  (e.g.,  Rosenberg  and  Wiens  1976;  Winner  etal.  1980;  Krieger  1984;  La 
Point  etui.  1984;  Clements  et ul.  1988;  Canfield  et ul.  1994),  the  success  of  correctly  assess- 
ing  the  health  or  degradation  of  these  communities  depends  on  how  well  the  responses 
caused  by  contamination  can  be  discriminated  from  responses  caused  by  other  environ¬ 
mental  factors  (Dunson  and  Travis  1991;  Hughes  1995). 

Environmental  assessments  may  confuse  natural  variability  with  environmental  degrada¬ 
tion  because  a  thorough  understanding  of  the  many  natural  factors  that  can  influence  or 
regulate  variability  is  lacking  (Landis  et  al  1994).  Reynoldson  (1984)  and  France  (1990) 
have  previously  cautioned  that  the  macrobenthos  can  respond  to  seemingly  minor 
changes  in  substrate  particle  size,  organic  content,  texture,  and  water  quality  as  well  as  to 
the  presence  of  contaminants.  Furthermore,  spatial  heterogeneity  in  depositional  areas 
can  be  high,  which  requires  large  numbers  of  sampling  unit  replicates  to  distinguish  be¬ 
tween  natural  variability  and  anthropogenic  perturbation.  For  example,  an  SERA  of  the 
Upper  Clark  Fork  River  in  Montana  (Canfield  etal  1994)  recognized  that  increased  num¬ 
bers  of  Chironomidae  and  Oligochaeta,  and  a  predominance  of  metal-tolerant  species  in 
metal-contaminated  sediments,  suggested  an  imbalanced  benthic  community;  however, 
they  also  noted  that  factors  such  as  differences  in  habitat  and,  perhaps,  intermittent 
physical  disturbances  could  account  for  the  observed  community  structure.  Differences 
in  habitat  type  are  sources  of  spatial  heterogeneity;  intermittent  disturbances  are  sources 
of  temporal  heterogeneity  (Townsend  and  Hildrew  1994).  An  investigation  of  acidified 
rivers  and  lakes  throughout  the  province  of  Ontario  (Gibbons  and  Mackie  1991)  revealed 
that  reproductive  output  of  the  amphipod  Hyalella  azteca  could  be  correlated  with  a  num¬ 
ber  of  environmental  variables  {e.g.,  sulphate,  calcium  hardness,  sediment  particle  size, 
seston,  and  organic  matter  of  the  fine  sediment)  in  addition  to  the  variable  of  interest, 
decreased  pH.  Thus,  it  is  important  in  the  SERA  process  to  adequately  describe  the 
benthic  communities  that  are  being  sampled  and  to  understand  the  natural  environmen¬ 
tal  factors  affecting  various  habitats  located  within  assessment  and  reference  areas. 

The  objectives  of  this  paper  are  1)  to  describe  environmental  variables  that  are  important 
in  structuring  natural  benthic  invertebrate  communities  in  rivers  and  lakes,  2)  to  address 
the  influence  of  spatial  and  temporal  scales  on  habitat  characteristics,  3)  to  discuss  ap¬ 
proaches  that  may  be  useful  in  differentiating  between  natural  variability  and  anthropo¬ 
genic  stress  in  SERA,  and  4)  to  identify  implications  for  SERA  and  future  research  needs. 

10.2  RiverS/  lakes,  and  scales 

Communities  of  organisms  respond  to  and  are  structured  by  an  array  of  abiotic  and  bi¬ 
otic  variables.  The  significance  of  these  variables  to  SERAs  depends  on  the  location  of  the 
variables  (i.e.,  in  lakes  or  rivers)  and  on  the  spatial  and  temporal  scales  of  interest.  For 
example,  rivers  are  oriented  horizontally  {e.g„  Vannote  et  al  1980;  Johnson  et  al  1995), 
whereas  lakes  are  oriented  vertically  (Ryder  and  Pesendorfer  1989).  Rivers  provide  con¬ 
nections  for  transfer  of  materials,  whereas  lakes  provide  storage. 
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Morphology  (e.g,,  gradient,  width,  depth,  substrate  type),  hydrodynamics,  and  tempera¬ 
ture  together  form  the  principal  components  of  the  abiotic  milieu  of  streams.  In  contrast, 
morphology  (e.g.,  depth,  area,  volume)  and  climate  determine  the  seasonal  hydrodynam¬ 
ics  of  lakes  (e.g.,  spring  and  fall  overturns,  and  wave  dynamics  of  north-temperate  lakes). 
Most  of  the  energy  flow  in  lakes  begins  with  autochthonous  production  (photosynthesis), 
which  is  in  contrast  to  allochthonous  production  that  occurs  in  most  parts  of  rivers. 
Other  differences  in  abiotic  and  biotic  variables  of  rivers  and  lakes  are  summarized  in 
Table  10-1. 

A  close  coupling  exists  between  abiotic  and  biotic  variables  in  fresh  waters  and  their  ef¬ 
fects  on  sediment  contamination  (Table  10-1).  The  sediments  of  lakes  and  rivers  harbor 
the  organisms  of  interest  to  SERA,  so  the  different  effects  of  abiotic  and  biotic  variables 
on  sediment  contamination  are  of  interest  here.  For  example,  mean  particle  size  in  rivers 
decreases  in  a  downstream  direction,  whereas  in  lakes  it  decreases  with  depth.  In  both 
cases,  mean  particle  size  affects  both  processes  within  sediments,  and  the  movement  of 
sediments  and  contaminants  into  the  water  column.  Floating  plants  are  limited  to  slow- 
flowing  reaches  of  rivers,  whereas  they  are  restricted  to  sheltered  bays  of  lakes.  Floating 
plants  may  directly  affect  the  transport  of  contaminants  to  sediments  when  they  die, 
settle,  and  decay.  Many  other  examples  of  such  interactions  appear  in  Table  10-1. 

The  spatial-temporal  organization  of  rivers  and  lakes  has  special  relevance  to  SERA,  and 
the  variables  being  examined  in  an  SERA  must  be  appropriate  to  the  spatial  and  tempo¬ 
ral  scales  chosen  for  analysis.  It  is  important  to  note  that  those  scales  selected  for  exami¬ 
nation  are  artificial  extracts  from  what  is  a  natural  continuum.  Furthermore,  factors  that 
affect  benthic  invertebrates  in  rivers  and  lakes  at  different,  arbitrarily  chosen,  spatial  and 
temporal  scales  can  act  simultaneously  (Resh  and  Rosenberg  1989). 

10.2.1  Rivers 

Physical  factors  operating  at  a  variety  of  spatial  and  temporal  scales  ultimately  create  the 
plethora  of  riverine  environments.  Frissell  et  al  (1986)  have  identified  a  number  of  spa¬ 
tial  scales  at  which  river  systems  can  be  examined;  these  scales  occur  in  an  hierarchy 
ranging  from  the  entire  stream  system  (catchment)  to  the  microhabitat  level  (Figure 
10-1).  Physical  characteristics  at  any  given  scale  generally  are  determined  from  the  scales 
above  in  the  hierarchy  (Townsend  and  Hildrew  1994).  That  is,  aquatic  systems  at  progres¬ 
sively  larger  scales  in  the  hierarchy  are  the  result  of  progressively  larger-scale  and  less- 
frequent  geophysical  events.  However,  it  is  the  small-scale  processes  such  as  erosion  and 
deposition  that  ultimately  produce  the  habitats  normally  investigated  (see  below),  and  it 
is  these  processes  that  impinge  directly  on  individual  organisms.  To  explain  this  phenom¬ 
enon,  we  have  selected  several  scales  in  an  hypothetical  river  system. 

The  catchment  of  this  hypothetical  river  covers  about  250,000  km^  and  it  has  been  in 
existence  for  a  little  more  than  100,000  years.  The  physical  forces  responsible  for  its 
present  form  include  the  recent  geology  of  the  area,  its  glacial  history,  and  long-term  rain¬ 
fall  patterns. 
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Table  10-1  Differences  in  some  physical  and  biological  attributes  of  lotic  and  lentic  systems  and  their 
relationship  to  sediment  contamination 


Attributes 

Lotic  systems 

Lentic  systems 

Activities  in  relation  to 
sediment  contamination 

Erosion/deposition  ratio 

Decreases  downstream 

Decreases  from  nearshore 
littoral  to  greater  depths 

1,3 

Shoreline  erosion 

Extensive,  induced  by 
water  currents 

Localized,  induced  by 
wind-driven  waves 

1 

Mean  particle  size 

Decreases  downstream 

Decreases  with  depth 

2,3 

Number  of 
substrate  types 

Increases  downstream 

Determined  by 
geomorphology  and  wave 
action 

2 

Distribution  of  substrate 

Determined  by  wave 
currents;  gravity  driven 

Determined  by  wind- 
induced  currents,  geology, 
and  geography 

1, 2,3 

Breadth/shoreline 
length  ratio 

Low 

High 

1 

Watershed/surface 
area  ratio 

High 

Low 

1 

Ice  scouring  effects 

Robust,  extensive 

Localized  to  windward, 
near-shore  littoral 

3,4 

Flow  characteristics 

Unidirectional, 

horizontal 

Three-dimensional 

3,4 

Current 

Gravitational  movement, 
decreases  vertically 

Wind-induced, 

convectional 

3.4 

Flow  diversions 

Common,  habitat 
implications 

Rare 

1,3 

Water-level  fluctuations 

Flooding 

Minor  variations 

1,3,4 

Flooding  effects  on  biota 

Traumatic;  reset  event 

Diminished 

1.3,4 

Groundwater/surface 
drainage  ratio 
(summer) 

High  ratio  decreases 
temperature;  low  ratio 
increases  temperature 

Significant  only  in  seepage 
lakes;  effect  same  as  in 
rivers 

2.3 

Oxygen  content 

Usually  high 

Variable 

2. 3, 4? 

Diurnal  variation 
of  dissolved  O2 

Dependent  upon 
photosynthesis  and 
respiration  rates,  and 
decomposition 

Not  significant 

2,3 

Phosphorus 

Variable,  headwaters  to 
mouth 

High  in  spring  and  at 
overturns 

2 

Total  dissolved  solids 

Increase  downstream 

Temporal  cycle 

2, 3? 

Turbidity 

High;  varies  among  rivers  Low;  varies  among  lakes 
and  spatially  and  temporally 

3 
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Table  10-1  continued 


Activities  in  relation  to 

Attributes 

Lotic  systems 

Lentic  systems 

sediment  contamination 

Rate  of  nutrient  influx 

Governed  by  terrestrial 

Determined  by  internal 

1,2,3 

vegetation,  flooding 

cycling  rates 

Speed  of  nutrient  transit 

Rapid;  determined  by 

Slow;  determined  by  wind 

1,2 

gradient  and  current 

and  waves,  and  overturn 

Nutrient  retention 

Low 

High 

2,3 

Mineral  uptake  rates 

Rapid 

Low 

2,3 

Floral  distribution 

Reliant  upon  nutrients. 

Dependent  on  depth. 

2 

current 

substrate 

Phytoplankton 

abundance 

Low;  greatest  in 
intermediate  or  high 
orders;  limited  by 
turbidity,  turbulence 

High 

1,2 

Bryophytes/lichens 

Prevalent;  fast-flowing 
areas;  shade  tolerant; 
some  limited  by  hard 

Scarce 

1, 2, 4? 

water 

Macrophyte  adaptations  Few;  restricted  to 

Many 

1, 2, 4? 

Floating  plants 

running  waters 

Limited  to  slow-flowing 

Restricted  to  sheltered 

1? 

reaches;  prevalent  in 
tropical  rivers 

bays 

Primary  fish  forage 

Invertebrates 

Forage  fishes 

2,3 

Major  energy  source 

Detritus 

Solar  radiation 

1,2 

Nutrient  regime 

Spiraling 

Cycling 

2 

Reset  events 

Floods  (severe);  ice 
scouring 

Overturn  (moderate) 

1,2, 3,4 

Resilience 

High 

Low 

2,4 

Community  structure 

Sequential  (seasonal); 

Harmonic;  successional 

1,2, 3, 4 

successional  (spatial); 

(temporal);  mainly 

mainly  stochastic 

deterministic 

Community  refiigium 

Boundary  layer; 

Ecotone;  shallow. 

1,2? 

hyporheic  zone  Q.e., 
deep,  inhabitable 

inhabitable  substrate 

substrate) 

Source:  Attributes  and  diiFerences  from  tables  1-8  of  Ryder  and  Pesendorfer  (1989)  Can  Spec  Publ  Fish  Aquat  Sci 
106:65-85.  Reprinted  by  permission. 

“Activities”  column  added:  1  =  Direct  effects  on  transport  of  contaminants  to  sediments;  2  =  Effects  on  processes  within 
sediments;  3  =  Movement  of  sediments  and  contaminants  into  water  column;  4  =  Removal  of  contaminants  from 
sediments;  ?  expresses  uncertainty 
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Figure  10-1  Spatial  and  temporal  scales  in  rivers:  catchment,  reach,  and  microhabitat  levels 


If  the  scale  is  reduced  several  orders  of  magnitude  to  a  stream  reach,  this  consists  of  a 
riffle-pool  sequence.  The  reach  occupies  an  area  of  -500  m^  Its  present  form  was  deter¬ 
mined  by  the  stream  gradient  in  the  area  and  the  last  100-year  flood. 

Reducing  the  scale  to  the  microhabitat  level,  a  single  rock  sitting  in  the  riffle  of  the  stream 
reach  is  considered.  The  top  of  the  rock  covers  an  area  of  «100  cm^  The  rock  has  been 
there  since  the  last  major  freshet  produced  enough  tractive  force  to  roll  it  from  elsewhere 
to  its  present  position  (e.g.,  within  weeks).  As  long  as  this  rock  is  stable,  it  serves  as  poten¬ 
tial  habitat  for  benthic  invertebrates. 

Next  we  describe  the  factors  that  affect  benthic  invertebrates  at  the  three  spatial-tempo¬ 
ral  riverine  scales  chosen. 

1)  Catchment  level.  Several  authors  have  used  multivariate  statistical  approaches 
to  examine  factors  that  regulate  lotic  macroinvertebrates  over  large  geographic 
scales.  Ormerod  and  Edwards  (1987)  examined  a  small  catchment  (4200  km^) 
in  Wales;  Corkum  and  Currie  (1987)  and  Corkum  (1989)  examined  -100  rivers 
in  Alberta,  British  Columbia,  the  Yukon,  and  Alaska,  and  Wright  etal  (1984, 
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1988)  studied  rivers  in  all  of  Great  Britain.  Each  of  these  studies  discovered  a 
number  of  variables  that  were  correlated  with  benthic  invertebrate  community 
structure  (Table  10-2).  Little  concordance  is  evident  among  the  variables  in  the 
studies,  partly  because  of  the  different  geographic  areas  and  partly  because  of 
differences  in  the  variables  measured. 

Recently,  we  used  a  multivariate  approach  to  design  a  biomonitoring  program 
based  on  benthic  invertebrates,  for  the  Fraser  River  catchment  in  British 
Columbia  (unpublished  data).  The  program  uses  physical  and  chemical 
variables  collected  along  with  benthic  invertebrates  from  250  reference  sites 
spread  through  the  catchment.  The  aim  of  the  program  is  to  create  a  model, 
based  on  reference  conditions,  that  uses  conservative  physical  and  chemical 
variables  to  predict  the  community  structure  of  macroinvertebrates.  Predicted 
community  structure  at  impacted  sites  may  then  be  compared  to  actual  benthic 
communities,  and  the  divergence  would  indicate  the  extent  of  remediation 
required.  The  method  is  an  adaptation  of  the  River  InVertebrate  Prediction  And 
Classification  System  (RIVPACS)  developed  in  Great  Britain  by  Wright  et  al 
(1984, 1988)  and  the  BEnthic  Assessment  of  SedimenT  (BEAST)  model  devel¬ 
oped  for  the  Great  Lakes  by  Reynoldson  et  al  (1995).  These  large-scale  models, 
of  necessity,  do  not  identify  what  has  caused  the  benthic  invertebrate  commu¬ 
nity  to  deviate  from  reference  conditions;  that  information  comes  from  smaller- 
scale  experimental  work. 

2)  Reach  level.  A  number  of  factors  influence  benthic  invertebrates  at  this  level, 
which  is  the  most-studied  scale  in  rivers,  including  substrate  stability,  discharge, 
temperature,  and  food  availability  (Resh  and  Rosenberg  1984).  For  example,  the 
stability  of  substrate  at  a  given  discharge  will  determine  the  numbers  of 
invertebrates  present  in  a  stretch  of  stream  {e.g.,  Cobb  et  al  1992).  Invertebrate 
numbers  may  be  reduced  during  a  spate  in  patches  of  small,  unstable  gravel  or 
shale,  whereas  at  the  same  discharge  they  are  not  affected  in  patches  of  larger, 
more  stable  substrate.  Densities  and  diversities  of  invertebrates  in  affected 
reaches  can  recover  quickly  to  pre-spate  levels  during  periods  of  low  flow  (Cobb 
etal  1992). 

Changes  in  the  above  factors  that  result  from  anthropogenic  activities  at  the 
reach  level  can  also  affect  benthic  invertebrates.  For  example,  extensive 
clearcutting  of  forests  may  result  in  increased  discharge  that  moves  substrate  at 
unexpected  times  of  the  year.  If  the  natural  capacity  of  the  system  to  absorb 
disturbance  is  exceeded,  the  ability  of  the  benthic  invertebrate  community  to 
recover  may  be  substantially  altered.  Another  example  may  be  dams  that  release 
water  much  colder  than  can  be  tolerated  by  downstream  benthic  fauna  histori¬ 
cally  adapted  to  warmer  water  {e.g.,  Lehmkuhl  1972).  Such  types  of  abiotic 
factors  must  be  considered  alongside  chemical  stressors  in  SERA. 

But  what  about  the  pool  part  of  the  riffle-pool  sequence?  In  general,  the  combi¬ 
nation  of  subhabitats  collectively  referred  to  as  a  pool  has  received  less  study 
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Table  10-2  Summary  of  catchment-level  (and  larger)  variables  that  correlate  with  benthic  invertebrate 
_ community  structure  in  lotic  systems 


Variable 

United  Kingdom  ^ 

Pacific  Northwest^ 

South  Wales^ 

Distance  from  source 

+ 

+++ 

+-H- 

Slope 

-K-f 

+ 

+++ 

Latitude 

— 

•HH- 

Altitude 

+ 

+ 

+++ 

Discharge  category 

-H-f 

_ 

+ 

Geology 

— 

+++ 

Vegetation  cover 

— 

-I-I-+ 

_ 

Physiography 

— 

+ 

_ 

Land  use 

— 

•HH- 

— 

Stream  order 

_ 

+ 

Mean  channel  width 

-l-H- 

+ 

Depth  category 

+ 

_ 

Substrate  heterogeneity 

+ 

— 

— 

Date 

+ 

+-H- 

Water  width 

+ 

+++ 

Water  depth 

-t- 

+ 

+++ 

Surface  velocity 

+++ 

+ 

Mean  substrate 

+++ 

+ 

+ 

Dominant  particle  size 

+++ 

_ 

%  macrophyte  cover 

+++ 

+ 

_ 

Overhanging  vegetation 

— 

+++ 

— 

pH 

+++ 

-1- 

+++ 

Oxygen 

+ 

_ 

Nitrate 

-1- 

_ 

+++ 

Chloride 

+++ 

_ 

Phosphate 

-m- 

_ 

+ 

Alkalinity 

-H-f 

_ 

+++ 

Conductivity 

— 

+++ 

+++ 

^  Source:  Wright  etaL  1984. 

^  Sources:  Corkum  and  Currie  1987;  Corkum  1989. 

^  Source:  Ormerod  and  Edwards  1987. 

+  =  variable  measured;  +++  =  variable  correlated  with  invertebrate  community  structure 


than  riffles.  However,  pools  are  depositional  habitats  and  fine  sediments  collect 
in  them,  so  they  can  be  places  where  contaminants  concentrate  in  streams. 
Likely,  pools  and  their  invertebrate  fauna  behave  more  like  lacustrine  habitats 
of  a  similar  spatial  and  temporal  scale  than  do  riffles. 

3)  Microhabitat  level.  Two  major  factors  affect  the  fauna  at  this  spatial  and 
temporal  scale:  a)  tractive  force  of  water  flow  on  a  rock,  which  determines 
whether  the  rock  is  stable  and  provides  suitable  habitat  or  whether  it  is  mobile 
and  discourages  colonization  by  stream  invertebrates;  and  b)  the  hydraulic 
regime  of  water  flowing  over  a  rock,  which  determines  the  species  present  and 
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where  they  are  located  on  the  rock  {e.g.,  Newbury  1984;  Statzner  et  al.  1988).  For 
example,  the  filter-feeding  caddisfly  Brachycentrus  occidentalis  typically  occurs 
on  the  top  of  a  cobble  as  the  flow  accelerates  to  pass  over  the  obstruction  posed 
by  the  cobble  {e.g.  Wetmore  etal  1990;  Figure  10-2).  Here,  the  streamlines 
contract  as  flow  accelerates  so  that  an  organism  extending  its  filtration  appara¬ 
tus  up  into  the  flow  will  intercept  a  high  proportion  of  suspended  material.  A 
filter  feeder  in  this  position  would  have  access  to  higher  rates  of  food  delivery 
than  if  it  were  to  occur  in  the  deeper,  uniform  flow  area  just  approaching  the 
cobble.  Distribution  of  another  filter  feeder,  such  as  larvae  of  the  biting  black  fly 
Simulium  vittatum,  can  be  similar  to  B.  occidentalis  for  the  same  reason.  In 
conducting  SERAs,  this  example  demonstrates  the  importance  of  sampling 
similar  microhabitats  in  reference  and  affected  areas  to  avoid  confusing 
microhabitat  effects  with  contaminant  effects. 

Sediment  ecological  risk  assessment  is  normally  done  at  scales  above  the  microhabitat 
level,  but  knowledge  at  the  microhabitat  level  is  required  to  provide  a  mechanistic  under¬ 
standing  of  the  effects  of  disturbance  at  these  larger  scales.  For  example,  consider  a 
stream  reach  beside  a  cobble-bed  stream  that  is  cleared  for  farming,  crops  are  grown,  and 
pesticides  are  applied  in  the  normal  course  of  farming  practice.  During  heavy  rainfall,  soil 
erosion  occurs  and  organic  particles  coated  with  pesticide  are  washed  into  the  stream. 
The  filter-feeding  community  is  then  at  risk  from  these  contaminants.  This  result  may 
not  be  considered  detrimental  if  Simulium  black  fly  larvae  are  killed  (and  biting  adults 
are  not  produced),  but  if  other  non-pest  species  like  B.  occidentalis  are  affected,  a  signifi¬ 
cant  portion  of  the  energy  processing  capability  of  the  stream  could  be  impacted.  Tempo¬ 
rally,  the  threat  to  filter  feeders  would  persist  until  the  soil  erosion  event  stops,  the  next 
spate  rearranges  the  stream  substrate,  or  the  insect  pupates  and  emerges  as  an  adult. 

The  stream  benthic  community  that  is  sampled  in  the  normal  course  of  an  SERA  on  this 
hypothetical  stream  reach  is  different  from  the  community  occurring  before  farming  was 
initiated  and  also  differs  from  that  occurring  in  a  reference  stream.  For  the  above  ex¬ 
ample,  it  is  clear  that  a  microhabitat-level  understanding  can  explain  the  mechanisms 
behind  a  specific  response;  such  elucidation  then  permits  the  development  of  a  manage¬ 
ment  plan  to  ameliorate  these  effects. 

10.2.2  Lakes 

No  widely  used  hierarchical  classification  system  exists  for  lentic  waters  that  is  similar  to 
Frissell  et  al’s  (1986)  system  for  lotic  waters.  For  comparison,  we  propose  a  similar 
scheme,  based  upon  a  regional  lake  system.  An  hypothetical  regional  lake  catchment 
covering  about  500,000  km^  is  shown  in  Figure  10-3.  The  geological  foundation  for  the 
catchment  was  set  in  the  Precambrian  period,  some  three  billion  years  ago,  but  the 
present  system  of  lakes  (and  streams)  has  existed  for  =10,000  years,  since  the  last  major 
glaciation.  As  with  the  hypothetical  river  system  previously  described,  major  physical 
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Figure  10-2  Local  flow  profile  for  larval  habitat  of  the  caddisfly  Brachycentrus  occidentalis  in  Wilson 

Creek,  Manitoba.  Measurements  of  water-surface  elevation,  bed  elevation,  and  mean  flow 
velocity  taken  at  stations  marked  ■.  Heavy  arrow  shows  position  of  insect  larva. 
Streamlines  (dashed)  are  adjacent  to  larval  location  to  illustrate  local  zone  of  flow 
convergence  and  acceleration.  Source:  Wetmore  etal.  1990.  Reprinted  by  permission  of 
the  Journal  of  the  North  American  Benthological  Society. 


disturbances  and  climate  change  operating  on  a  global  scale  and  over  long  periods  of 
time  have  structured  the  basic  environment  of  this  system. 

Next,  the  scale  is  reduced  several  orders  of  magnitude  to  a  bay  on  one  of  the  lakes.  It  oc¬ 
cupies  an  area  of  about  10  km^  and  was  formerly  a  glacial  meltwater  channel  whose  out¬ 
flow  was  blocked  some  7000  years  ago. 

The  scale  can  again  be  reduced  several  orders  of  magnitude  to  a  patch  of  fine-grained 
sediment  at  the  marshy  end  of  the  bay.  The  patch  covers  an  area  of  1  m^  and  it  contains 
particulate  organic  material,  which  mostly  originated  from  the  last  phytoplankton  bloom 
30  d  before)  and  which  provides  food  for  a  diverse  assemblage  of  invertebrates  such 
as  chironomid  larvae,  mayfly  nymphs,  sphaeriid  molluscs,  and  oligochaetes.  The  sedi¬ 
ment  patch  may  be  altered  by  freshets  in  the  creek  that  drains  into  the  marshy  end  of  the 
bay  or  by  seasonal  storm  events  that  are  strong  enough  to  produce  waves  that  rework  the 
bottom  sediments  of  the  bay. 

The  factors  affecting  benthic  invertebrates  at  the  three  spatial-temporal  scales  established 
are  as  follows: 

1)  Regional  level.  Johnson  and  Wiederholm  (1989)  and  Reynoldson  et  al  (1995) 
used  multivariate  approaches  to  examine  factors  that  affect  benthic  invertebrate 
community  structure  in  lakes  over  large  geographic  areas  (Table  10-3).  Six 
groups  of  lakes  were  identified  from  the  68  oligo-mesohumic  lakes  in  Sweden 
examined  by  Johnson  and  Wiederholm  (1989),  based  on  invertebrate  species 
composition.  Three  of  these  groups  were  most  influenced  by  low  pH,  high  80^"=, 
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Figure  10-3  Spatial  and  temporal  scales  in  lakes:  regional,  bay,  and  sediment-patch  levels 


high  temperature,  and  high  phytoplankton  biovolumes.  Two  more  were 
associated  with  high  pH  and  low  total  phosphorus  (TP)  concentrations.  The  last 
group  consisted  of  only  one  lake,  from  which  only  two  taxa  of  macroinverte¬ 
brates  were  collected.  The  best  predictors  of  community  structure  for  all  groups 
were  depth,  silica,  bicarbonate  (alkalinity),  phytoplankton  production,  and  pH. 
Five  species  groups  were  identified  from  the  96  sites  examined  by  Reynoldson  et 
al.  (1995)  in  the  Laurentian  Great  Lakes.  Of  the  25  environmental  variables  they 
examined,  the  best  water-column  predictors  of  occurrence  in  a  group  were 
depth,  NOj  -nitrogen,  alkalinity,  and  pH.  The  most  relevant  sediment  variables 
were  percent  silt,  organic  content  (measured  as  %  loss  on  ignition),  and  concen¬ 
trations  of  AP^  and  Na"^. 

Other  studies  of  benthic  invertebrate  communities  in  lakes  have  used  a  multiple 
regression  approach  to  relate  biomass  of  benthic  invertebrates  to  environmen¬ 
tal  variables  (Hanson  and  Peters  1984;  Rasmussen  and  Kalff  1987;  Table  10-3). 
Hanson  and  Peters  (1984)  found  that  TP  was  the  best  univariate  predictor  of 
biomass  in  the  38  lakes  of  their  study;  combining  TP  and  lake  area  slightly 


SET AC  Press 


146 


ECOLOGICAL  RISK  ASSESSMENT  OF  CONTAMINATED  SEDIMENTS 


improved  the  relationship.  The  best  model  from  Rasmussen  and  Kalff ’s  (1987) 
study  of  131  lakes  involved  chlorophyll  concentration,  depth,  slope,  water  color, 
and  temperature.  Both  of  the  above  studies  included  lakes  from  several  conti¬ 
nents. 

The  results  of  these  four  studies  suggest  that  relatively  simple  and  consistent 
variables  can  be  used  to  predict  benthic  invertebrate  community  structure  or 
biomass  in  lakes.  Depth  was  an  important  predictor  in  all  four  studies,  phy¬ 
toplankton  production  (chlorophyll)  was  important  in  three,  and  pH,  tempera¬ 
ture,  phosphorus,  and  nitrogen  were  important  in  two  (Table  10-3). 

The  BEAST  model  developed  by  Reynoldson  etal  (1995)  for  the  Laurentian 
Great  Lakes  (described  above)  proposes  a  multivariate  approach  for  assessing 
sediment  quality.  The  model  has  already  been  used  to  measure  the  extent  of 
sediment  contamination  (compared  to  reference  conditions)  in  a  metal- 
contaminated  harbor  and  to  recommend  remediation  in  certain  parts  of  the 
harbor. 

2)  Bay  level.  The  hypothetical  bay  is  long  and  narrow  and  is  joined  to  the  main 
body  of  the  lake  by  a  shallow  inlet  {e.g.,  see  Bodaly  and  Lesack  1984).  Therefore, 
it  receives  most  of  its  water  and  nutrients  from  the  main  lake  only  when  water 
levels  are  high  enough  to  flow  over  the  inlet,  as  for  example  during  spring  high- 
water  events.  This  situation  has  existed  since  the  bay  was  formed,  some  7000 
years  ago.  A  small  creek  flows  into  the  opposite  end  of  the  bay;  it  also  provides 
nutrients  and  fine  sediments.  The  creek  end  of  the  bay  is  a  marshy  area,  which 
supports  the  highest  benthic  invertebrate  abundances  in  the  bay.  The  shoreline 
around  the  rest  of  the  bay  is  composed  of  a  variety  of  substrate  types,  most  of 
which  are  either  bedrock  or  sand  and  silt.  This  variety  of  shoreline  type  pro¬ 
duces  a  spatially  disjunct  distribution  of  habitats  suitable  for  benthic  inverte¬ 
brates.  Leaf  litter  inputs  in  the  autumn  are  important  sources  of  energy  because 
the  bay  is  long  and  narrow  {i.e.,  the  ratio  of  terrestrial  areaibay  surface  area  is 
high),  and  it  is  oligotrophic, 

A  contaminant  spill  into  the  bay  would  have  negative,  long-term  consequences 
for  the  benthic  invertebrate  community  because  water  renewal  and  sediment 
retention  times  are  so  long.  In  a  worst-case  scenario,  prevailing  winds  could 
move  the  contaminant  to  the  marshy  end  of  the  bay  where  benthic  populations 
would  be  most  affected;  perhaps  the  contaminant  {e.g.,  oil)  would  persist  there 
over  time.  Although  remediation  and  assessment  will  be  undertaken  at  this 
spatial  scale,  we  shall  see  below  that  biological  responses  occur  at  a  smaller 
scale. 

3)  Sediment  patch  level.  Three  major  factors  determine  the  occurrence  and 
abundance  of  benthic  invertebrates  in  a  fine-grained  sediment  patch  at  the 
marshy  end  of  the  bay:  a)  food  of  sufficient  quality  and  quantity  {i.e.,  productiv¬ 
ity),  b)  tolerable  temperature  extremes,  and  c)  sufficient  dissolved  oxygen.  The 
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Table  10-3  Summary  of  catchment-level  (and  larger)  variables  that  correlate  with  benthic  invertebrate 
community  structure  in  lentic  systems 


Lentic  system 

Variable 

Laurentian  Great 
Lakes^ 

Sweden^ 

38  lakes  on 
various  continents^ 

131  lakes  on 
various  continents'* 

Latitude 

-m- 

— 

— 

— 

Geology 

+ 

— 

_ 

— 

Vegetation  cover 

+ 

— 

— 

— 

Physiography 

+ 

— 

— 

— 

Land  use 

+ 

— 

— 

— 

Surface  area 

_ 

_ 

-HH- 

+ 

Slope 

— 

_ 

— 

-HH- 

Substrate 

+ 

— 

— 

— 

Date 

+ 

+ 

— 

— 

Water  depth 

-HH- 

+++ 

-HH- 

-HH- 

Mean  substrate 

+ 

_ 

— 

— 

Dominant  particle  size 

4-H- 

— 

— 

— - 

Sediment  chemistry 

-H-f 

— 

— 

— 

pH 

-m- 

-HH- 

— 

— 

Oxygen 

+ 

-!- 

— 

— 

Nitrogen  (TN,  nitrate) 

+++ 

-HH- 

— 

— 

Chloride 

-1- 

— 

— 

— 

Sulphate 

— 

-H-E 

— 

— 

Phosphorus  (TP,  phosphate) 

+ 

-HH- 

-HH- 

Alkalinity 

-HH- 

+ 

— 

~ 

Conductivity 

— 

— 

— 

+ 

Temperature 

+ 

+-H- 

— 

-HH- 

Color 

— 

+ 

— 

+++ 

Major  ions 

+ 

-HH- 

— 

— 

Water  clarity 

— 

+ 

— 

+ 

Phytoplankton  production 

— 

-HH- 

■HH- 

-HH- 

Zooplankton  production 
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effect  of  these  factors  on  the  occurrence  and  abundance  of  Chironomidae  and 
Oiigochaeta  is  illustrated  in  Figure  10-4. 

As  with  the  stream  example,  investigation  at  this  smallest  scale  can  provide  a  mechanistic 
understanding  of  events  that  occur  at  larger  scales.  For  example,  if  a  lumbering  operation 
is  started  in  the  catchment  of  the  bay  and  the  bay  is  eventually  clear  cut,  normal  seasonal 
rainfall  washes  sediment  and  organic  matter  into  the  bay;  nutrient  levels  then  increase. 
Algal  production  is  enhanced  and  the  rise  in  organic  matter  deposited  in  the  sediment 
increases  bacterial  oxygen  demand  in  the  sediment.  By  late  summer,  when  maximum 
water  temperatures  occur,  the  waters  over  this  sediment  patch  become  anoxic.  Thus,  a 
sediment  patch  that  was  formerly  dominated  by  a  diverse  array  of  oxygen-sensitive  inver¬ 
tebrates  shifts  to  one  dominated  by  tubificid  oligochaetes  that  tolerate  anoxia.  (In  the 
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event  of  a  coincident  oil  spill,  effects  caused  by  anoxia  must  be  separately  discerned  from 
those  caused  by  chemical  contamination).  The  altered  community  will  probably  persist 
until  sufficient  reforestation  occurs  to  prevent  any  further  soil  erosion  and  organic  matter 
inputs  to  the  bay.  The  invertebrates  that  formerly  inhabited  the  sediment  patch  may  then 
recolonize  it.  Note,  again,  that  processes  affecting  benthic  communities  at  this  spatial 
scale  are  a  composite  of  events  occurring  at  even  smaller  spatial  scales  {e.g.,  particle  inges¬ 
tion,  fluxes  across  biological  membranes).  As  discussed  for  rivers  and  streams,  an  under¬ 
standing  of  these  smaller  scale  processes  is  critical  to  the  development  of  remediation 
plans  for  lentic  environments. 

In  summary,  spatial  and  temporal  variability  are  characteristic  of  aquatic  ecosystems  and 
must  be  accounted  for  in  the  design  of  SERAs,  the  elucidation  of  results  obtained,  and  the 
extrapolation  of  study  conclusions  to  management  options.  Such  variability  involves 
populations  and  communities,  can  occur  in  individual  habitats  or  at  ecosystem  scales, 
and  can  operate  at  time  scales  ranging  from  hours  to  years.  Such  variability  also  should 
be  expected  by  investigators  trying  to  determine  perturbation-induced  changes  in 
aquatic  ecosystems. 

The  most  likely  scale  of  investigation  for  SERA  will  be  the  reach  level  in  lotic  ecosystems 
and  a  bay  (or  part  of  a  lake)  in  lentic  ecosystems.  The  environmental  factors  important  in 
structuring  these  environments  operate  at  the  next  highest  scale,  but  the  most  appropri¬ 
ate  sampling  unit  will  be  the  next  lowest  scale  {e.g,  the  cobble  substrate  in  a  stream  or  the 
fine  sediment  patch  in  a  lake).  Although  studies  at  the  smallest  spatial  and  temporal 
scales  are  usually  inappropriate  for  SERA,  it  is  critical  to  understand  processes  that  oper¬ 
ate  at  these  scales  because  they  form  the  underlying  biological  fabric  for  benthic  commu¬ 
nities.  After  all,  it  is  at  these  scales  that  community  structure  begins. 

10.3  Identifying  anthropogenic  impact 

The  success  of  any  SERA  lies  in  the  ability  of  the  methods  used  to  distinguish  between 
impacted  and  unimpacted  conditions.  Measures  of  both  functional  and  structural 
change  have  been  used  to  identify  anthropogenic  impacts.  The  sediment  quality  triad 
approach  (Chapman  etal  1991, 1992)  integrates  data  from  physical  and  chemical  analy¬ 
ses,  laboratory  exposure  to  whole  sediments,  and  benthic  invertebrate  community  struc¬ 
ture  to  determine  effects.  The  nature  of  the  impact  of  concern  should  determine  the 
measure  used.  If  the  concern  is  related  to  contaminant  accumulation  in  vertebrates,  then 
it  is  necessary  to  understand  the  uptake  of  contaminants  from  sediments  and  their  move¬ 
ment  through  the  food  web  (see  Luoma  and  Fisher,  Chapter  14).  If  the  concern  is  one  of 
general  environmental  degradation,  then  measurement  of  change  in  the  structure  of 
benthic  communities  is  appropriate. 

An  array  of  methods  is  available  for  community  structure  assessment.  Usually,  the  mea¬ 
sured  attributes  are  based  on  taxa  richness,  enumerations,  or  some  integrative  measure 
of  the  two  ie,g,  diversity  or  similarity  indices;  Resh  and  Jackson  1993).  Univariate  ap¬ 
proaches  are  most  frequently  used  (Norris  and  Georges  1993)  and  can  be  quantitative 
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(e.g.,  hypothesis  setting,  rigorous  statistical  design,  replication;  Resh  and  McElravy  1993) 
or  qualitative  {Le.,  rapid  assessment  procedures;  Resh  and  Jackson  1993).  These  methods 
usually  compare  control  sites  to  impacted  sites  by  inference. 

The  use  of  rapid  assessment  procedures  as  an  alternative  to  traditional  quantitative  ap¬ 
proaches  has  been  embraced  by  regulatory  agencies  in  the  United  States  (Davis  and 
Simon  1995).  These  methods  (Plafkin  et  al  1989;  Resh  et  al.  1995)  involve  reduced  sam¬ 
pling  and  identification  costs  compared  to  quantitative  studies  using  benthic  inverte¬ 
brates.  Reference  community  types  are  often  based  on  an  ecoregion  approach  (Omernik 
1995).  In  most  rapid  assessment  programs,  several  different  measures  of  benthic  commu¬ 
nities  {le.,  the  “multimetric”  approach)  are  used  in  evaluations.  The  best  measurements 
to  use  in  rapid  assessment  programs  are  discussed  in  Resh  and  Jackson  (1993)  and  Resh 
etal  (1995). 

Multivariate  approaches  can  be  divided  into  two  types:  1)  multiple  regression  and  2) 
paired  matrices.  Multiple  regression  develops  relationships  between  benthic  inverte¬ 
brates  and  environmental  attributes  but  considers  only  univariate  measures  of  commu¬ 
nity  structure.  The  paired  matrices  approach  tries  to  identify  pattern  and  structure  in  the 
benthic  invertebrate  community  and  then  relate  that  to  environmental  variables  {e.g., 
Wright  et  al  1984;  Corkum  and  Currie  1987;  Johnson  and  Wiederholm  1989).  The  pur¬ 
pose  of  this  approach  is  to  establish  predictive  models  of  communities  that  can  be  used 
to  assess  environmental  impairment.  If  the  predictive  models  are  based  on  reference 
conditions,  then  the  models  derived  can  be  used  to  measure  divergence  from  the  refer¬ 
ence  state  and  the  degree  of  risk  associated  with  an  environmental  stressor.  For  example, 
the  approach  developed  by  Reynoldson  et  al  (1995)  has  allowed  the  setting  of  appropri¬ 
ate,  site-specific  and  biologically  based  sediment-quality  objectives  for  the  Laurentian 
Great  Lakes  using  easy-to-measure  habitat  characteristics.  Future  bioassessment  pro¬ 
grams  will  probably  combine  elements  of  rapid  assessment  and  multivariate  methods 
(Resh  etal  1995). 

10.4  Implications  for  sediment  ecological  risk  assessment 

What  are  the  critical  needs  for  doing  SFRA  in  rivers  and  lakes?  First,  the  appropriate  spa¬ 
tial  and  temporal  scales  for  the  assessment  must  be  determined.  Three  discrete  spatial- 
temporal  scales  were  presented  above  for  a  hypothetical  river  and  lake  system,  but  in 
reality  spatial  and  temporal  scales  both  comprise  a  continuum  of  conditions.  The  scales 
chosen  ultimately  will  depend  on  the  questions  being  asked  and  the  requirements  of 
environmental  managers  and  decision  makers.  The  factors  being  measured  need  to 
match  the  scales  chosen;  mismatches  probably  will  not  provide  answers  to  the  questions 
being  asked.  As  the  scales  increase,  the  risk  assessment  process  becomes  more  difficult; 
for  example,  increasing  geographic  size  of  the  study  area  will  result  in  changes  in  species 
composition  and  will  require  more  habitats  to  be  sampled.  At  the  same  time,  better 
methods  are  needed  to  assess  the  influence  of  spatial  and  temporal  variations  in  structure 
and  function  of  benthic  communities  on  both  accumulation  and  effects  of  sediment- 
based  contaminants. 
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Second,  it  is  important  to  understand  the  environmental  factors  that  affect  benthic  inver¬ 
tebrate  communities  at  the  spatial  and  temporal  scales  chosen  and  the  natural  variabil¬ 
ity  of  these  factors.  Natural  variability  may  decrease  in  moving  from  larger  to  smaller 
scales,  but  certainly  this  is  not  always  true.  The  impacts  of  anthropogenic  stresses  can 
only  be  understood  in  the  context  of  deviations  from  natural  variability.  The  establish¬ 
ment  of  regional  reference  bases  for  benthic  invertebrate  community  structure,  either 
through  a  biocriteria  approach  (e.g.,  Davis  and  Simon  1995)  or  through  multivariate  ap¬ 
proaches  (e.g.,  Wright  et  al  1984, 1988;  Reynoldson  et  al  1995),  allow  the  effects  of  an¬ 
thropogenic  impacts  to  be  determined  over  wide  geographic  areas. 

Third,  bioassessment  methods  traditionally  have  used  univariate  approaches.  This  may 
have  been  suitable  for  the  severe  organic  pollution  problems  characteristic  of  the  first  half 
of  the  twentieth  century  in  developed  and  industrialized  countries,  but  the  many  con¬ 
founding  factors  associated  with  modern-day  multiple  inputs  of  toxicant  mixtures  may 
call  for  the  increased  use  of  multivariate  approaches. 

10.5  Future  research  needs 

A  key  future  need  is  that  a  constant  proportion  of  funding  in  risk  assessment  research 
should  be  directed  to  questions  involving  basic  systematic  and  ecological  research.  The 
perilous  state  of  contemporary  systematics  and  recommendations  for  its  resurrection  are 
outlined  by  Wheeler  (1995).  Systematics  underlies  our  understanding  of  the  natural 
world  because  species  are  the  basic  units  in  that  world.  Knowledge  of  species  is  funda¬ 
mental  to  biomonitoring  and  SERA. 

The  need  for  applied  freshwater  ecology  is  obvious,  but  it  is  impossible  to  apply  knowl¬ 
edge  that  does  not  exist  (Johnson  etal  1993).  An  understanding  of  benthic  invertebrate 
ecology  is  a  prerequisite  to  implementing  a  biological  approach  to  ecosystem  manage¬ 
ment,  as  practiced  in  SERA. 

Last,  it  is  important  to  determine  the  degree  to  which  experimental  results  derived  at  one 
spatial-temporal  scale  can  be  extrapolated  to  other  scales  {e.g,  see  Fee  and  Hecky  1992; 
Fee  et  al  1992;  Turner  et  al  1995).  Continued  support  of  long-term  field  research  facili¬ 
ties  that  have  access  to  lakes  and  rivers  of  different  sizes  is  essential  to  this  need. 
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11.1  Introduction 

Equilibrium  models  for  the  dynamics  of  ecological  systems  are  often  used  as  a  framework 
in  which  to  place  the  goals  of  ERA.  There  is  a  growing  body  of  evidence  that  equilibrium 
models  are  not  good  descriptions  of  ecological  events  and  that  alternatives  do  exist.  Non¬ 
equilibrium  models  are  powerful  tools  in  explaining  and  predicting  events  across  both 
aquatic  and  terrestrial  systems  (Reice  1994).  Given  tools  derived  from  conventional  sta¬ 
tistics  (Johnson  1988;  Kersting  1988)  and  machine  learning  (Matthews,  Matthews,  and 
Hachmoller  1991;  Matthews,  Matthews,  and  Ehinger  1991;  Matthews  et  al  1995a, 
1995b;  Landis  et  al  1994),  it  is  now  possible  to  detect  differences  in  noisy  ecological  sys¬ 
tems.  Are  these  differences  significant?  If  the  detectable  differences  are  representations 
of  the  potential  for  ecological  systems  to  react  differently  to  a  subsequent  stressor  event, 
then  the  differences  are  biologically  and  ecologically  significant.  The  sections  below 
present  current  definitions  of  recovery,  data  that  indicate  the  persistence  of  information 
concerning  stressor  events  and  the  heterogeneity  of  sediment,  and  a  nonequilibrium 
hypothesis  for  the  reaction  of  a  sediment  system  to  a  stressor  event.  We  also  propose  the 
replacement  of  the  term  recover  in  environmental  toxicology  with  restructure  or  reconstruct 
to  more  accurately  reflect  the  process  undergoing  investigation  or  prediction. 

1 1.2  Definitions  of  recovery 

Recovery  is  often  stated  as  a  goal  or  desired  state  in  ERA.  As  often  defined,  recovery  is  the 
return  to  a  state  so  that  the  assessment  endpoints  are  of  values  not  statistically  different 
compared  to  those  before  the  stressor  event.  Another  common  definition  is  that  the  im¬ 
pacted  system  has  moved  to  a  state  not  statistically  distinguishable  from  the  surrounding 
environmental  mosaic  as  exempfified  by  a  reference  site.  Both  of  these  types  of  defini¬ 
tions  are  dependent  on  the  finding  of  no  statistically  significant  or  no  projected  signifi¬ 
cant  difference  between  the  site  of  interest  and  the  reference  site  or  state.  Therefore,  these 
definitions  depend  upon  opinion  or  upon  the  available  statistical  tools  and  their  power. 
A  more  precise  definition  is  as  follows:  Return  after  a  disturbance  to  a  state  that  reacts  to 
subsequent  events  as  if  the  initial  disturbance  had  not  occurred. 

This  type  of  definition  recognizes  that  for  recovery  to  occur,  the  information  about  the 
prior  stressor  event  needs  to  be  erased  from  the  system.  In  effect,  the  system  is  ahistorical 
(see  Lewontin  1969).  However,  many  lines  of  evidence  suggest  that  ecological  systems  are 
historical  and  are  by  definition  complex  systems.  If  ecological  systems  are  historical,  then 
recovery  as  defined  above  cannot  occur. 
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11.3  Persistence  of  information  in  ecological  systems 

Using  tools  derived  from  machine  learning,  detectable  differences  in  ecological  structures 
subjected  to  different  stressor  regimens  have  been  found  to  be  persistent.  Streams 
(Matthews,  Matthews,  and  Hachmoller  1991),  microcosm  experiments  (Landis, 
Matthews,  Markiewicz,  Shough,  and  Matthews  1993;  Landis,  Matthews,  Markiewicz, 
and  Matthews  1993;  Landis  etal  1994;  Landis,  Matthews,  Markiewicz,  and  Matthews 
1995),  and  biomarkers  in  vole  populations  (Fairbrother  et  al,  in  press),  all  have  demon¬ 
strated  that  detectable  differences  are  the  rule.  Even  as  differences  disappear  at  certain 
times,  they  can  reappear  in  a  treatment  related  fashion.  Other  laboratory  research  on  the 
structure  and  dynamics  of  ecological  systems  (Drake  1991;  Drake  etal  1993)  have  recog¬ 
nized  the  persistence  of  information.  Patterns  abound  in  systems,  and  historical  events 
are  written  into  the  structure  and  dynamics  of  ecological  structures.  Ecological  systems 
are  complex  by  definition  in  the  terminology  of  Nicolis  and  Prigogine  (1989).  Complex 
systems  and  ecological  systems  (Brooks  et  al  1989)  have  a  common  property  of  being 
irreversible. 

Our  research  group  (Landis,  Matthews,  Markiewicz,  Shough,  and  Matthews  1993; 
Landis,  Matthews,  Markiewicz,  and  Matthews  1993;  Landis  et  al  1994;  Landis, 
Matthews,  Markiewicz,  and  Matthews  1995)  has  also  described  the  persistence  of  infor¬ 
mation  within  even  simple  model  ecological  systems,  i.e.,  the  standardized  aquatic  micro¬ 
cosm  (SAM)  (ASTM  1991)  and  the  mixed  flask  culture  (MFC)  (Leffler  1984;  Stay  et  al 
1988, 1989)  when  dosed  with  the  water-soluble  fraction  of  jet  fuels.  The  64-d  SAM  pro¬ 
tocol  is  comprised  of  10  algal,  3  invertebrate,  and  1  bacterial  species  introduced  into  3  L 
of  sterile,  chemically  defined  medium.  Four  treatment  groups  are  comprised  of  six  repli¬ 
cates.  The  MFC  uses  naturally  occurring  assemblages  of  aquatic  organisms  that  are  col¬ 
lected  from  local  streams  and  lakes,  brought  back  to  the  laboratory  and  allowed  to 
reassemble  and  restructure  during  a  3-month  equilibration  period.  Subsamples  are  then 
used  to  inoculate  1  L  experimental  vessels,  and  a  cross-  and  re-inoculation  procedure  is 
used  to  attempt  homogeneity  among  the  experimental  replicates.  As  with  the  SAM  ex¬ 
periments,  4  treatments  with  6  replicates  are  used. 

We  use  tools  derived  from  artificial  intelligence  (Al)  and  machine  learning  research 
(Matthews  et  al  1995a,  1995b)  to  look  for  patterns  within  the  diverse  dataset  typical  of 
ecological  experiments.  Typical  analytical  methods,  such  as  analysis  of  variance 
(ANOVA),  are  generally  not  able  to  effectively  combine  data  as  disparate  as  dichotomous 
sequence  presence/absence  with  continuous  data  such  as  pH,  algal  counts,  and  soil  res¬ 
piration.  These  kinds  of  data  are  characteristic  of  problems  dealt  with  traditionally  by  AL 
Handling  the  visual  system  of  a  robot,  for  instance,  in  a  room  full  of  tools,  debris,  light 
and  shadows,  is  a  classic  problem  of  analyzing  large  quantities  of  “dirty”  data  into  mean¬ 
ingful  categories.  In  previous  work,  we  have  shown  that  environmental  datasets  are  ame¬ 
nable  to  Al  techniques  (Matthews,  Matthews,  and  Hachmoller  1991;  Matthews, 
Matthews,  and  Ehinger  1991;  Landis,  Matthews,  Markiewicz,  Shough,  and  Matthews 
1993;  Landis,  Matthews,  Markiewicz,  and  Matthews  1993;  Landis  etal  1994). 
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In  these  sets  of  experiments  (Landis,  Matthews,  Markiewicz,  Shough,  and  Matthews 
1993;  Landis,  Matthews,  Markiewicz,  and  Matthews  1993;  Landis,  Matthews, 
Markiewicz,  and  Matthews  1995),  our  AI  and  other  analysis  tools  filtered  a  complex 
dataset  to  reveal  patterns  that  went  unnoticed  by  unaided  human  ecologists.  We  discov¬ 
ered  a  persistent  relationship  between  the  treatment  group  and  the  clusters  within  the 
data  over  the  course  of  these  experiments.  The  effects  were  persistent  beyond  the  detec¬ 
tion  of  toxicant  within  the  experimental  systems.  These  results  have  led  to  the  develop¬ 
ment  of  the  community  conditioning  hypothesis  by  Matthews  etal  (1996).  These  model 
ecological  systems  and  analysis  techniques  have  been  adapted  to  sediment  microcosms. 

11.4  Sediment  microcosms 

The  experiments  described  above  were  carried  out  with  the  jet  fuel  delivered  to  the  mi¬ 
crocosm  system  as  a  percentage  of  a  water  soluble  fraction.  However,  Sandberg  (1993) 
performed  MFC  microcosm  experiments  that  injected  jet  fuel  toxicant  into  the  sediment 
component  of  a  mixed-flask-culture-type  microcosm.  Four  treatment  groups  were  used. 
Unlike  the  conventional  jet  fuel  experiments,  the  toxicant  would  reappear  within  the 
water  column  after  stirring  was  conducted  to  sample  the  various  organisms.  Cosine  and 
vector  distances  and  nonmetric  clustering  coupled  with  association  analysis  were  used  to 
examine  the  dataset  (Landis  etal  1994).  It  was  found  that  the  effects  of  the  introduction 
of  the  fuel  were  observable  during  the  first  21  days  after  dosing,  were  not  seen  during 
days  21  through  35,  then  reappeared  and  persisted  during  the  course  of  the  two-month- 
long  experiment  (Figure  11-1).  In  other  words,  the  information  related  to  treatment  type 
remained  within  the  model  ecological  systems. 

11.5  Heterogeneity  of  sediments  in  time  and  space 

Unlike  laboratory  experiments  that  strive  to  enhance  the  uniformity  of  the  environment, 
sediments  are  heterogeneous  in  space  and  time.  A  study  by  Wiegers  (1994)  sampled  a 
typical  freshwater  pond,  Claypit  Pond,  that  was  known  to  be  contaminated  with  heavy 
metals.  Fifteen  cores  were  taken,  and  each  5  cm  was  analyzed  for  metals  using  several 
extraction  techniques.  Presented  in  Table  11-1  are  the  variance-to-mean  ratios  of  the  15 
sample  sites  presented  by  depth.  A  variance-to-mean  ratio  of  1  is  indicative  of  a  normal 
distribution.  A  variance-to-mean  ratio  of  greater  than  1  indicates  a  contagious  or 
clumped  distribution.  A  ratio  of  less  than  1  indicates  an  even  distribution  (Elliott  1971). 
As  can  be  seen  in  Table  11-1,  the  variance-to-mean  ratio  for  the  metals  studied  is  generally 
greater  than  1  for  chromium,  lead,  and  zinc.  Assuming  that  the  greater  depths  were  in¬ 
dicative  of  past  conditions,  it  can  also  be  seen  that  this  ratio  has  varied  greatly.  A  vari¬ 
ance-to-mean  ratio  as  high  as  53  for  chromium  was  observed  in  these  sediment  cores. 

Such  a  degree  of  heterogeneity  indicates  that  Claypit  Pond,  and  other  sediment  environ¬ 
ments,  are  patchy  environments.  The  heterogeneity  in  the  environment  and  the  distur¬ 
bance  regimen  is  likely  to  be  responsible  for  the  biotic  structure  of  the  ecological  system 
(Reice  1994).  Nonequilibrium  factors  are  then  critical  for  the  maintenance  of  the  struc¬ 
ture,  a  property  of  ecological  systems  often  used  as  an  assessment  endpoint  in  ERA. 
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Functional  Parameters 


Figure  11-1  Clustering  and  association  analysis  of  the  sediment  mixed-flask  culture  functional  data.  For 
most  of  the  period  of  the  experiment,  clustering  demonstrated  a  statistically  significant 
association  with  treatment.  Horizontal  dashed  line  denotes  95%  confidence  level.  Analysis 
variables  include  pH,  measurements  of  dissolved  oxygen,  and  absorbence  of  light.  A 
parallel  analysis  with  biotic  (structural  parameters)  showed  a  similar  pattern.  During  days 
21  through  35,  none  of  the  clustering  methods  identified  clusters  associated  with 
treatment:  however,  clusters  associated  with  treatment  reappeared  after  day  35. 

Not  only  is  the  chemical  composition  of  a  sediment  heterogeneous,  so  is  the  distribution 
of  organisms.  The  protistan  species  of  the  Paramecium  aurelia  andP.  bursaria  complexes 
usually  exhibit  highly  contagious  distributions  (Landis  1981, 1982),  as  do  many  other 
benthic  organisms. 

11.6  A  hypothesis  called  ^^community  conditioning^^ 

Our  results  and  the  repeated  confirmation  of  the  heterogeneity  of  ecological  systems  have 
led  to  our  proposing  the  “community  conditioning  hypothesis”:  ecological  systems  tend 
to  preserve  information  about  every  event  in  their  etiology  (Matthews  et  al  1996).  The 
information  can  be  held  in  the  interaction  among  the  organisms  within  a  community,  in 
the  frequency  of  genetic  markers  within  a  population,  or  in  the  pattern  of  biomarkers 
(Landis  et  al  1996).  The  corollaries  to  this  hypothesis  are  these: 

1)  Communities  are  a  product  of  their  unique  etiology,  which  is  the  historical 
collection  of  physical,  chemical,  and  biological  events  leading  up  to  a  point  in 
time.  No  two  ecological  systems  will  ever  be  identical. 

2)  Events  that  alter  the  structure  or  function  of  populations  within  the  community 
become  a  part  of  the  history  of  the  community  and  are  difficult  to  erase.  The 
influence  of  the  event  may  increase  or  decrease  over  time,  but  it  is  not  lost  from 
the  history  of  the  community. 

3)  Information  can  be  stored  in  an  uncataloged  array  of  biotic  and  abiotic  forms, 
such  as  varieties  of  detritus,  phenotypic  fitness,  sediment  composition,  or  the 
genetic  structure  of  constituent  populations.  Any  subset  of  community  mea- 
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Table  11-1  Heterogeneity  of  sediments:  variance  divided  by  mean  sediment  concentration 
in  Claypit  Pond 


Variance-to-mean  sediment  concentration  ratio  (pg/g)^ 


Digestion 

Pond  depth 
(cm)’’ 

Chromium 

Copper‘d 

Lead 

Zinc 

Strong  acid 

0-5 

5.6 

1.1 

7.4 

4.8 

5-10 

16 

1.5 

9.3 

6.1 

10-15 

53.6 

1.6 

8.2 

8.7 

15-20 

22.1 

0.7 

13.6 

5.4 

20-25 

1.5 

1.2 

3.3 

2.4 

Acetic  acid 

0-5 

1.2 

0.3 

0.06 

6.1 

5-10 

4.5 

0.2 

0.3 

13.6 

10-15 

6.8 

0.2 

0.02 

13.5 

15-20 

6.7 

0.6 

1.01 

7.6 

20-25 

0.7 

0.7 

0 

1.2 

^  Often  variance-to-mean  ratio  is  greater  than  1,  indicating  that  contaminant  distribution  is  highly  clumped . 

Ratios  also  vary  with  depth  of  sediment,  indicating  that  distribution  of  metal  contaminant  varies  over  time. 

Cores  taken  from  15  sites  within  the  pond. 

Note  that  variance-to-mean  ratio  for  copper  is  less  than  1  for  both  digestion  techniques,  indicating  even  distribution 
of  this  metal. 


surements,  such  as  single-species  population  counts  or  reproduction  dynamics, 
cannot  be  assumed  to  be  representative  of  the  entire  community. 

4)  Information  may  be  retained  by  properties  of  the  community  that  remain 
hidden,  unmeasured  or  unmeasurable,  for  indefinite  time  periods.  The  poten¬ 
tial  of  this  conditioning  to  alter  the  future  trajectory  of  the  community  may 
remain  undiminished. 

5)  Almost  all  environmental  events  can  leave  lasting  effects.  “No  observed  effect” 
does  not  imply  “no  effect.” 

The  characteristic  of  ecological  systems  to  preserve  information  about  their  etiology 
poses  an  interesting  set  of  considerations.  Given  that  no  two  ecological  systems  will  have 
identical  etiologies,  similarities  instead  of  differences  should  perhaps  be  the  emphasis.  A 
general  assumption  is  made  that  if  two  systems  are  different,  the  cause  is  proximate. 
Given  different  etiologies,  the  cause  may  be  either  proximate  or  due  to  historical  events 
widely  separated  in  time.  It  is  difficult  to  know  which. 

11.7  Differences  and  similarities 

One  of  the  implications  of  the  improvement  in  our  ability  to  recognize  differences  in  eco¬ 
logical  systems  is  that  differences  are  the  rule  rather  than  the  exception.  The  recognition 
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of  the  nonequilibrium  nature  of  ecological  systems  and  the  incorporation  of  information 
about  etiologies  state  that  no  two  systems  will  be  the  same.  Turning  the  question  around, 
it  is  perhaps  more  sensible  to  measure  similarities  of  the  systems.  We  are  currently  ex¬ 
ploring  the  application  of  AI  to  the  measurement  of  similarity  with  both  static  and  dy¬ 
namic  data.  Initial  results  are  encouraging.  Initial  analysis,  using  the  similarity 
measurement  program  RIGGLE,  confirms  the  finding  that  Treatment  1  (non-dosed)  mi¬ 
crocosms  are  most  similar  to  the  treated  replicates  within  an  experiment  compared  to 
Treatment  1  replicates  of  other  experiments.  Analyses  using  generated  data  also  illumi¬ 
nate  the  ability  of  the  technique  to  measure  similarities  statistically  and  dynamically. 
Comparisons  of  static  and  dynamic  similarity  can  demonstrate  out-of-step  dynamics  and 
other  characteristics  of  the  dynamics  of  populations. 

11.8  Implications  for  sediment  risk  assessment 

Future  ecological  assessments  can  be  set  so  that  similarity  scores  are  the  determining 
factor.  Underlying  assumptions,  which  mirror  the  reality  of  the  complex  nature  of  eco¬ 
logical  structures,  are  that  no  two  systems  are  identical  and  that  responses  to  a  future 
stressor  may  be  widely  divergent.  If  long-term  management  of  ecological  structures  is  to 
be  a  national  goal,  and  a  mix  of  multiple  stressors  a  long-term  reality,  then  an  under¬ 
standing  of  these  basic  tenets  will  be  essential  to  accurate  and  useful  assessments. 

As  ecological  systems  incorporate  historical  information,  regulatory  decisions  and  poli¬ 
cies  must  be  based  on  the  knowledge  that  toxic  insults,  as  well  as  other  disturbances,  will 
have  effects.  At  a  particular  instance  in  time,  we  may  not  be  able  to  obtain  evidence  of  the 
effects  by  measuring  a  subset  of  variables  within  the  community.  Detection  of  effects  will 
depend  upon  the  nature  of  the  effect  and  our  ability  to  predict  its  time  and  form  of  ap¬ 
pearance.  What  is  required  is  the  regulatory  courage  to  say  which  effects  we  will  choose 
to  look  for  in  the  community,  which  effects  will  be  allowed,  and  which  will  not.  Relying 
upon  the  fallacies  of  “no  effect”  or  “no  observable  effect”  is  unsound  and  potentially  di¬ 
sastrous  if  implemented  as  policy.  The  realization  must  come  about  that  unacceptable 
effects  will  occur,  even  without  anthropogenic  inputs.  Systems  change,  all  of  the  time, 
and  our  actions  are  an  inevitable  part  of  that  change. 

11.9  Terminology 

The  term  recovery  has  been  used  to  mean  a  variety  of  dynamics  and  outcomes,  both  in 
ecology  and  environmental  toxicology.  Given  that  ecological  systems  are  nonequilibrium 
structures,  it  makes  no  sense  to  use  a  term  that  has  its  basis  in  classical  stability-recovery 
dynamics.  After  all,  to  recover  a  typical  sediment-based  ecological  system,  it  will  be  nec¬ 
essary  to  recreate  the  layers  of  heterogeneity  and  dynamic  diversity  that  characterize  a 
complex,  irreversible  system  (Landis,  Matthews,  and  Matthews  1995).  Restructure  or  re¬ 
construct  may  be  more  accurate  descriptions  of  the  process  and  the  predictions  of  ecologi¬ 
cal  risk  assessment  and  have  been  suggested  as  possible  replacements  for  recovery  (Reice 
1994). 
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Why  all  the  bother  over  semantics  and  terminology?  The  answer  is  straightforward.  The 
use  of  terminology  such  as  recovery  also  brings  with  it  the  recovery-stability  model  of  the 
ecology  of  the  1920s  to  1970s  (with  notable  exceptions).  No  matter  how  it  may  be  opera¬ 
tionally  defined,  most  investigators  and  regulators  bring  with  the  term  recovery  the 
recovery-stability  metaphor  with  all  of  its  misconceptions.  A  similar  problem  exists  with 
the  use  of  ecosystem  health  as  a  metaphor  for  the  status  of  an  ecological  system  (Suter 
1993).  The  use  of  new  terminology  to  more  accurately  reflect  the  nonequilibrium  dynam¬ 
ics  of  ecological  systems  should  better  define  the  processes  and  goals  of  ecological  and 
sediment  risk  assessment. 
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12.1  Introduction 

Ecological  relevance,  in  ERA,  can  take  on  a  number  of  definitions  depending  upon  scien¬ 
tific,  societal,  or  administrative  perspectives. 

The  scientific  perspective  in  ERA  involves  the  study  of  basic  ecological  principles  to  provide 
an  understanding  of  how  ecosystems  function,  the  development  of  methods  to  detect  the 
effects  of  contaminants  on  the  integrity  of  these  ecosystems,  and  determination  of  the 
adequacy  of  predictive  powers  to  determine  how  ecosystems  will  function  when  stressors 
are  added  or  removed.  In  the  strictest  scientific  sense,  detectable  changes  in  structure  or 
function  are  ecologically  relevant  and  may  indicate  environmental  risks.  These  changes 
may  occur  within  the  bounds  of  natural  variability  and  be  benign,  or  they  could  be  more 
extreme  and  result  in  collapse  of  the  ecosystem.  The  detection,  measurement,  and  predic¬ 
tion  of  ecological  change  falls  into  the  realm  of  science,  whereas  the  degree  of  allowable 
change  is  not  only  a  scientific  problem  but  also  a  societal  and  administrative  problem. 

Societal  perspectives  involve  aesthetic,  recreational,  and  economic  valuations  of  a  resource 
that  is  important  to  a  stakeholder.  It  is  difficult  to  define  ecological  relevance  from  a  so¬ 
cietal  perspective  because  many  stakeholders  have  their  own  individual  valuation  system, 
so  items  of  importance  may  vary  from  conservation  of  a  backyard  stream  to  conservation 
of  a  National  Forest  Reserve,  and  from  maintenance  of  a  viable  fishery  to  protection  of  all 
species  within  an  ecosystem. 

The  administrative  perspective  involves  enforcement  of  regulations  and  statutory  man¬ 
dates  such  as  the  CWA  in  the  U.S.  or  the  Fisheries  Act  in  Canada.  In  a  managerial  context, 
ERA  is  necessary  for  new  product  assessment,  navigation  dredging,  or  site  cleanup.  Eco¬ 
logical  relevance  in  this  context  again  depends  upon  the  various  concerns  of  the  stake¬ 
holders  and  what  legislative  mandates  exist  to  protect  these  concerns. 

Ideally,  assessment  endpoints  in  the  ERA  framework  (USEPA  1992)  are  created  for  a 
given  problem  only  after  the  three  above  contexts  have  been  evaluated  and  integrated.  A 
variety  of  measurement  activities  can  then  be  assigned  based  on  a  common  assessment 
endpoint  (see  Chapter  7).  For  example,  it  may  be  desirable  for  an  industry  to  determine 
the  potential  for  contamination  of  sediments  downstream  from  their  outfall  by  any  prod- 
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uct  or  by-product  that  results  from  the  manufacturing  process.  An  administrative  (regu¬ 
latory)  context  may  require  that  the  sediment  in  the  stream  be  supportive  of  “fishable, 
swimmable  water.”  A  societal  context  may  value  the  maintenance  of  clean  sediment  to 
enable  continued  clam  and  flatfish  harvest.  A  scientific  context  may  want  to  mechanisti¬ 
cally  understand  bioaccumulation  of  the  by-product  from  contaminated  sediment  and  its 
effects  on  the  clam  or  flatfish  fishery.  A  mutually  acceptable  assessment  endpoint  may  be 
stated  as  follows:  “not  more  than  a  50%  reduction  in  5%  or  less  of  the  benthic  species  with 
no  significant  bioaccumulation  of  the  compound  of  concern  in  fishery  organisms.” 

Several  potential  measurement  activities  will  flow  from  this  assessment  endpoint:  1) 
determination  of  exposure,  2)  performance  of  acute  and  chronic  toxicity  tests  {e.g.,  effects 
on  survival,  growth,  reproduction)  using  actual  or  surrogate  species  that  live  on  or  near 
the  sediment,  3)  TIE  which  establishes  causation,  and  4)  comparison  of  potentially  im¬ 
pacted  benthic  communities  to  reference  communities  in  field  studies.  Such  measure¬ 
ments  may  be  comprehensive  and  valuable  in  a  weight-of-evidence  approach,  but  it  is 
difficult  to  determine  their  ecological  relevance  and  whether  achieving  an  endpoint  will 
prevent  future  permanent  ecological  damage.  More  research  effort  is  required  in  deter¬ 
mining  the  ecological  relevance  of  individual  measurements  before  we  can  reach  a  full 
understanding  of  risk. 

Four  areas  of  knowledge  are  required  in  the  understanding  of  ecological  relevance  as  they 
apply  to  an  SERA.  These  areas  depend  on  the  convergence  of  administrative  aims,  soci¬ 
etal  values,  and  scientific  perspectives,  and  are  as  follows:  1)  ability  to  predict  and  detect 
change  in  response  variables;  2)  understanding  change  in  response  variables;  3)  causa¬ 
tion  {i.e.,  linking  cause  and  effect);  and  4)  inference.  How  and  why  these  knowledge  areas 
influence  SERAs  will  be  discussed  in  the  following  subsections. 

Finally,  it  should  be  noted  that  two  issues  —  spatial  and  temporal  scales  and  information 
on  the  extent  of  exposure  —  are  germane  to  each  of  the  knowledge  areas.  These  issues  are 
integrated  into  each  of  the  four  knowledge  areas  discussed  below. 

12.2  Predicting  and  detecting  change 

The  assessment  of  ecological  risk  to  aquatic  biota  from  sediments  contaminated  with 
toxicants  requires  the  ability  to  predict  and  detect  toxicity.  Such  predictions  may  be 
based  on  simple  inferences  or  increasingly  more  complex  correlations  using  data  from  a 
variety  of  sources  such  as  1)  biomarkers  of  exposure,  2)  laboratory-based  toxicity  tests 
with  single  or  multiple  species,  3)  QSARs  between  chemical  structure  and  toxicity,  4) 
artificially  enclosed  multispecies  experiments  in  the  laboratory  or  under  semi-realistic 
field  conditions  (mesocosms),  5)  biological  surveys  conducted  at  field  sites,  or  6)  complex 
mathematical  models  of  ecological  structure  based  on  field-collected  data  from  clean  and 
contaminated  sites.  Each  of  these  methods  has  utility  under  selected  circumstances;  each 
is  an  abstraction  of  reality  and  has  advantages  and  disadvantages  in  terms  of  ecological 
relevance. 
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12.2.1  Biological  models 

In  each  of  the  model  systems  presented  below,  biological  models  (cellular  or  subcellular 
biomarkers,  responses  to  toxicity  by  whole  organisms,  responses  to  toxicity  by  popula¬ 
tions  and  communities,  etc.)  are  used  to  predict  toxicity  and  its  effects  on  the  structure 
and  function  of  ecological  systems.  Some  models  are  more  successful  than  others  in  de¬ 
termining  causality,  especially  at  the  mechanistic  level,  but  may  be  less  ecologically  rel¬ 
evant.  Other  biological  models  can  provide  information  about  toxicity  without  providing 
an  understanding  of  the  actual  stressor-target  relationship.  For  example,  mortality  can 
be  observed  and  impacts  upon  ecological  systems  can  be  evaluated  without  knowing  the 
specifics  of  the  interactions.  Models  that  incorporate  more  than  one  species  or  attempt 
to  model  food  webs  may  be  quite  ecologically  relevant  because  both  direct  and  indirect 
effects  on  communities  and  their  interactions  may  be  observed.  In  addition,  biological 
models  can  serve  to  test  the  physical  and  conceptual  models  discussed  later  in  this  sec¬ 
tion. 

Molecular  biomarkers  or  bioindicators  (i.e.,  biochemical,  histological,  and  physiological 
indicators  of  anthropogenic  exposure)  generally  examine  the  mechanistic  aspects  of  a 
toxicant’s  effect  on  an  organism.  Several  biomarkers  have  been  developed  for  aquatic 
biota.  Examples  include  measures  of  the  induction  of  hepatic  P450  microsomal  cyto¬ 
chromes,  inhibition  of  brain  or  blood  cholinesterase  by  anticholinesterase  pesticides, 
aberrations  of  hemoglobin  synthesis,  chromosomal  damage,  etc.  (Huggett  et  al.  1992). 
Molecular  predictors  may  be  as  specific  as  CYPlAl,  a  P450  specifically  induced  by  chlo¬ 
rinated  organics  such  as  PCBs,  or  as  nonspecific  as  heat  shock  proteins.  These  types  of 
predictors  are  relevant  to  measuring  the  status  of  individual  organisms  and  to  indicating 
exposure  to  specific  or  nonspecific  contaminants.  Extrapolation  to  the  subsequent  effects 
of  contaminants  on  survival,  growth,  and  reproduction  within  a  population  based  on 
cellular  or  subcellular  changes  within  individuals  is  often  lacking.  As  a  result,  the  ability 
of  biomarkers  and  bioindicators  to  predict  alterations  in  the  structure  or  function  of  eco¬ 
systems  is  unknown.  Few  such  indicators  have  been  extensively  field-validated;  nor  have 
many  been  developed  for  organisms  that  inhabit  the  benthos  (Johnson  et  al.  1993). 

Single-species  toxicity  tests  have  long  been  used  to  evaluate  potential  toxicological  effects 
of  substances  found  contaminating  water  (Adams  1995)  or  sediment  (Burton  and 
MacPherson  1995).  These  tests  may  range  from  rapid  screening  procedures  such  as  the 
5-min  solid-phase  microbial  bioluminescence  test  (Microtox)  to  the  10-d  acute  amphipod 
test  for  survival  using  Rhepoxynius  abronius  (marine)  or  Hyalella  azteca  (freshwater)  to 
more  extensive  partial  or  complete  life  cycle  sublethal  tests  that  use  mortality,  growth,  or 
reproduction  as  endpoints.  Toxicity  tests  with  growth  and  reproduction  as  endpoints  are 
sublethal  estimates  of  the  toxicological  impacts  and  can  measure  effects  that  occur  at 
lower  concentrations  than  acute  mortality.  These  tests  are  also  available  for  a  wide  range 
of  organisms  from  different  trophic  levels  including  macrophytes  (rarely),  benthic  inver¬ 
tebrates  (often)  and  fish  (often).  Standard  methods  are  generally  available  that  allow 
some  comparability  between  laboratories  and  acceptable  QA/QC  (DeWitt  et  al.  1992; 
USEPA 1994;  ASTM 1995)  but  do  not  reflect  the  wide  range  of  conditions  found  in  the 
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environment.  Most  of  these  tests  are  based  upon  the  calculation  of  a  medium  effect  or 
lethal  concentration  or  dose  (LD50,  EC50, 1C50,  etc.)  when  sediments  are  spiked  with 
individual  contaminants,  or  upon  the  magnitude  of  response  for  field- collected  sedi¬ 
ments.  However,  the  most  useful  information  for  predictive  purposes  is  an  estimation  of 
the  dose-  (concentration-)  response  curve  and  the  subsequent  ranking  of  toxicity.  Often 
several  endpoints  are  measured  in  one  test  that  may  yield  several  dose-response  relation¬ 
ships.  A  variety  of  techniques  are  used  to  calculate  these  endpoints:  probit,  moving  angle 
average,  and  Spearman-Karber,  as  examples.  Most  of  these  methods  allow  calculation  of 
a  confidence  interval. 

An  NOEC  can  also  be  calculated  to  estimate  portions  of  the  dose-response  curve  that  are 
not  expected  to  result  in  a  toxicological  response.  These  NOEC  calculations  depend  upon 
the  number  of  replicates  used  in  the  toxicity  test  and  the  power  of  the  ANOVA  and  other 
multiple  comparison  techniques  used.  In  other  words,  a  particular  calculation  is  largely 
a  statistical  artifact  rather  than  an  absolute  measure.  A  more  realistic  computation  for 
predictive  use  is  the  ECS  or  EClO,  which  can  be  estimated  from  a  regression  of  the  doses 
against  responses. 

Predictions  of  toxicity  to  biota  living  in  contaminated  sediments,  based  on  single-species 
toxicity  tests  conducted  under  laboratory  conditions,  suffers  from  a  number  of  disadvan¬ 
tages  that  have  been  reviewed  extensively  by  Cairns  (1983),  Cairns  et  al  (1992),  and 
Landis  and  Ho-Yu  (1994).  These  include  1)  homogeneity  of  test  organisms  in  comparison 
to  natural  populations,  2)  health  of  test  species,  3)  exposure  conditions  that  may  not 
mimic  what  occurs  in  nature,  and  4)  inability  to  evaluate  indirect  or  secondary  impacts 
(Hurlbert  1975). 

There  are  also  difficulties  in  determining  just  what  the  ecological  relevance  of  any  calcu¬ 
lated  dose-response  endpoint  really  means  to  the  integrity  of  the  population  under  field 
conditions.  For  example,  how  does  a  50%  mortality  rate  for  organisms  exposed  to  sedi¬ 
ment  under  controlled  laboratory  conditions  extrapolate  to  detrimental  effects  on 
benthic  populations  exposed  to  predators  and  competitors  under  field  conditions  and 
suffering  from  natural  mortality?  What  level  of  mortality  actually  results  in  the  elimina¬ 
tion  or  extinction  of  a  population  of  organisms,  and  how  long  will  it  take  at  that  level?  Of 
greater  difficulty  is  the  interpretation  of  sublethal  endpoints  such  as  a  reduction  in 
growth  or  in  reproductive  output.  Does  a  10%  or  20%  reduction  in  biomass  or  growth  of 
a  cohort  of  organisms  exposed  under  laboratory  conditions  allow  a  prediction  of  effects 
on  a  population  that  can  no  longer  sustain  itself? 

Another  difficulty  of  the  traditional  toxicity  test  is  that  the  concentration  of  any  given 
toxicant  in  the  environmental  matrix  is  taken  as  a  representation  of  dose.  However,  the 
route  as  well  as  the  rate  of  uptake  of  many  compounds  may  differ  among  organisms,  and 
many  aspects  of  metabolism  may  alter  the  effective  or  target  dose  before  it  reaches  the 
site  of  action.  McCarty  (1991),  McCarty  et  al.  (1992),  and  McCarty  and  Mackay  (1993) 
have  suggested  that  a  measurement  of  the  toxic  potential  of  a  compound  through  im¬ 
provements  in  1)  our  ability  to  model  and  predict  the  fate  of  chemicals  in  aquatic  sedi- 
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merits,  2)  the  use  of  these  data  to  estimate  the  accumulation  of  chemical  residues  in  or¬ 
ganisms  and  in  assemblies  of  organisms  in  a  food  chain  or  web,  and  3)  the  ability  to  re¬ 
late  these  body  or  tissue  residues  to  various  acute  and  chronic  effects  as  determined  in 
toxicity  tests  and  bioassays  have  enhanced  the  assessment  of  ecological  risk  from  chemi¬ 
cals.  Such  links  between  critical  body  residue  (CBR)  and  adverse  biological  responses 
would  also  help  in  the  evaluation  of  toxic  mixtures  and  the  relative  contributions  of  a 
chemical  to  cumulative  toxicity. 

A  number  of  other  major  problems  specific  to  whole-sediment  toxicity  tests  in  addition 
to  those  discussed  above  are  1)  sediments  used  as  reference  or  control  sediments  may 
differ  in  their  physical/chemical  characteristics  from  those  used  in  treatments,  2)  organ¬ 
isms  are  exposed  to  sediments  that  have  been  altered  from  their  field  condition  during 
collection,  and  3)  organisms  are  usually  only  exposed  to  sediment  with  one  grain  size 
distribution  and  organic  matter  content,  especially  in  spiked-sediment  toxicity  tests  with 
single  contaminants.  When  such  toxicity  tests  are  used  to  predict  effects  under  natural 
conditions,  ecological  relevance  may  be  questionable.  Having  stated  this,  however,  there 
have  been  some  validations  which  correlate  community  responses  of  the  benthos  with 
toxicity  demonstrated  in  whole-sediment  toxicity  tests  (Canfield  et  al  1994;  Kemble  et  al 
1994;  Swartz  et  al.  1994;  Day  et  al.  1995;  Section  12.3.3;  Chapter  18). 

A  wide  variety  of  multispecies  toxicity  tests  can  be  used  to  predict  toxicity  (reviewed  by 
Kennedy  etal  1995).  Systems  that  contain  a  benthic  component  are  particularly  relevant 
to  sediment  work.  Artificial  streams  of  various  sizes  have  been  used  to  predict  toxicity  in 
the  field,  but  they  usually  require  a  riffle-pool  structure  where  fine  particulate  material 
can  be  deposited  to  be  relevant  to  the  issues  of  contaminated  sediments.  Multispecies 
toxicity  tests  have  some  advantages  as  models  of  natural  ecological  structures,  but  like 
natural  systems,  these  types  of  systems  exhibit  complex  dynamics  and  are  largely  irre¬ 
versible.  A  variety  of  endpoints  as  well  as  biodegradation,  bioaccumulation,  and  the  di¬ 
rect  and  indirect  effects  of  a  toxicant  upon  the  model  ecological  structure  can  be 
observed,  which  helps  in  the  predictive  process. 

Extrapolation  from  these  model  systems  to  the  ecological  structure  of  concern  has  been 
an  area  of  some  difficulty.  Multispecies  toxicity  tests  are  unlike  natural  ecological  struc¬ 
tures  in  several  important  ways.  First,  the  temporal  and  spatial  heterogeneity  of  the  ex¬ 
perimental  system  is  minimized  to  allow  easier  statistical  evaluation  of  the  dataset. 
Temperature,  chemical  composition,  and  the  composition  of  the  sediment  are  usually 
narrowly  defined.  Second,  the  species  composition  is  strictly  defined;  even  systems  that 
rely  upon  natural  innocula  often  specify  particular  species  or  functional  groups  as  desir¬ 
able  components.  Third,  migration  into  the  model  systems  is  usually  tightly  controlled, 
although  exceptions  exist  in  model  ecological  systems  that  have  been  used  to  study 
metapopulation  dynamics  (Drake  et  al  1993). 

Evaluation  of  the  data  is  a  common  technical  difficulty  with  multispecies  methods.  Sev¬ 
eral  techniques  have  been  used,  especially  conventional  univariate  ANOVA.  Regression 
methods  to  obtain  a  dose-response  relationship  have  also  been  used  (Liber  et  al  1992). 
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Derived  endpoints  are  often  used  such  as  species  diversity,  species  number,  species  rich¬ 
ness,  and  integrity  measurements  (Davis  1995).  These  derived  variables  are  supposedly 
simple  to  understand  and  have  low  variance,  but  they  are  simplifications  that  lose  the 
true  variability  inherent  within  the  system.  Many  of  these  derived  variables  are  also  based 
upon  assumptions  from  information  theory  or  other  theoretical  mathematical  models  so 
the  resultant  measure  is  an  information-losing  projection  based  upon  a  transformation, 
which  in  turn  is  based  upon  a  hypothesis  about  the  workings  of  ecological  systems.  A 
variety  of  multivariate  methods  may  provide  a  more  accurate  representation  of  ecologi¬ 
cal  structure. 

12.2.2  Quantitative  sfructure-acHvity  relationships 

Quantitative  structure-activity  relationship  models  are  mathematical  equations  derived 
to  estimate  the  toxicity  or  other  property  of  a  chemical  from  its  structure.  Quantitative 
structure-activity  relationships  are  usually  applied  to  predict  the  toxicity  of  new  chemi¬ 
cals  based  on  similarities  in  structure  or  substructure  of  a  molecule  to  those  of  other 
chemicals  with  known  toxicity.  Two  basic  sets  of  models  exist.  In  cases  when  the  structure 
of  the  receptor  is  well  known,  it  is  possible  to  use  molecular  modeling  to  examine  the 
interactions  such  as  hydrophobicity  or  hydrogen  bonding  between  a  receptor  and  the 
ligand.  Although  data  of  such  detail  exist  only  in  a  few  instances,  these  models  have 
proven  especially  useful  in  examining  the  effects  of  PCBs  and  hormone  mimics 
(McKinney  and  Waller  1994). 

Generally,  QSAR  models  are  statistical,  using  substructural  keys  and  correspondence  to 
toxicity  data  to  generate  predictions;  multivariate  linear  regression  and  discriminate 
analysis  are  commonly  used.  One  of  the  primary  difficulties  in  generating  QSARs  is  the 
lack  of  a  detailed  set  of  toxicity  tests  using  a  wide  variety  of  chemical  types.  Often  only  a 
few  chemical  structures  are  tested  using  acute  or  chronic  tests,  producing  large  gaps  in 
the  map  of  chemical  structure  versus  toxicological  response.  QSARs  rely  on  single-species 
toxicity  tests  to  provide  the  database;  therefore,  they  suffer  from  the  limitations  to  pre¬ 
dicting  ecological  responses  discussed  above.  Many  of  the  acute  or  chronic  data  used  in 
QSARs  suffer  from  lack  of  repeatability,  and  two  chemicals  may  have  a  very  different  tox¬ 
icity  value.  Another  limitation  is  the  difficulty  of  accurately  representing  the  structural 
aspects  of  the  chemical  that  contribute  to  the  toxicological  response.  Another  long  stand¬ 
ing  difficulty  has  been  the  inability  of  many  programs  to  recognize  sufficient  substruc¬ 
tural  keys,  relying  instead  upon  other  characteristics  such  as  the  log  of  the  octanol-water 
partition  coefficient.  In  spite  of  these  difficulties,  regression-type  QSAR  models  have  been 
developed  for  a  number  of  endpoints.  In  environmental  toxicology,  include  daphnid  and 
fish  toxicity  as  well  as  interspecies  models  (Enslein  etal  1987, 1989). 

12.2.3  Stochastic  and  deterministic  models 

These  types  of  predictive  tools  are  generally  derived  from  empirical  relationships  or  from 
first  principles.  Taylor  s  Power  Law  (Taylor  1961;  Elliott  1971)  is  an  empirical  relationship 
predicting  a  measure  of  contagion  from  the  number  of  organisms  within  a  collection.  The 
relationship  was  derived  from  field  data  and  seems  to  be  consistent  for  a  species.  In  con- 
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trast,  the  interactions  and  dynamics  predicted  by  the  classical  Lotka-Volterra  models  for 
population  competition  are  not  directly  related  to  responses  found  in  the  field.  These 
models  are  simple  constructs  with  terms  such  as  K  (carrying  capacity)  and  a  (competition 
coefficient).  Other  types  of  population  models  such  as  the  island  biogeography  deriva¬ 
tions  of  MacArthur  and  Wilson  (1967)  are  also  of  this  general  type.  In  the  early  and  mid- 
1970s,  discrete  models  of  population  dynamics  were  generated  using  difference  instead 
of  differential  equations.  The  use  of  these  models,  even  simple  ones,  led  to  the  co-discov¬ 
ery  of  strange  attractors  and  chaotic  dynamics. 

Barnthouse  (1992)  and  Suter  (1993)  describe  the  uses  of  common  models  in  the  field  of 
risk  assessment.  The  models  used  in  fisheries  and  wildlife  management  to  predict  the  risk 
of  extinction  as  a  result  of  toxicant  addition  are  especially  useful.  These  are  typically  de¬ 
mographic  models  based  on  a  matrix  representation  of  growth  and  mortality  rates  for 
each  age  or  size  class  of  fish  species.  Other  models  include  probabilistic  terms  as  part  of 
the  equations.  Stochastic  differential  or  stochastic  difference  models  can  provide  as  out¬ 
put  a  probability  distribution  from  a  number  of  iterations.  Stochastic  difference  equa¬ 
tions  are  similar  except  that  they  are  discrete;  instead  of  a  smooth  distribution,  they  can 
produce  a  rather  dramatic  range  of  possible  outcomes. 

Understanding  the  intrinsic  dynamics  of  each  specific  case  is  a  difficulty  in  the  use  of 
models.  Nonlinear  difference  equations  are  perhaps  an  extreme  case,  having  realms  of 
stability,  bifurcations,  and  eventually  chaotic  dynamics  (May  and  Oster  1978).  The  tran¬ 
sition  to  each  of  these  states  is  abrupt  but  can  be  easily  mapped. 

Oreskes  et  al  (1994)  discuss  the  use  of  models  in  the  geophysical  sciences.  Their  review 
is  relevant  to  much  of  the  modeling  conducted  as  part  of  an  SERA  because  of  the  open 
nature  of  geophysical  systems.  Oreskes  etal  (1994)  propose  that  models  can  only  be  con¬ 
firmed  by  the  use  of  experimental  data  but  not  be  verified  or  validated.  Models  assist  in 
the  creation  and  falsification  of  hypotheses. 

12.2.4  Univariate  and  multivariate  methods  for  detecting  anthropogenic 
impact 

The  success  of  any  ERA,  whether  for  sediments  or  any  other  environmental  matrix,  lies 
in  the  ability  of  the  methods  used  to  predict  effects  in  impacted  areas  versus  unimpacted 
areas.  The  inherent  variability  of  natural  ecosystems  and  the  complications  that  natural 
stressors  can  have  on  the  benthos  make  this  a  difficult  task. 

A  number  of  approaches  have  been  used  to  identify  “normal”  communities  at  a  given  site 
(Chapter  10).  The  most  common  approach  tries  to  develop  relationships  between  envi¬ 
ronmental  and  community  attributes  based  on  the  structure  of  the  biotic  community. 
Community  attributes  are  usually  based  on  taxa  richness,  enumerations,  or  some  integra¬ 
tive  measure  of  the  two  {e.g.,  biotic,  diversity,  or  similarity  indices).  Empirical  approaches 
generally  are  either  univariate  or  multivariate  in  nature. 

Univariate  approaches  are  the  most  frequently  used  (Norris  and  Georges  1993)  and  can 
be  quantitative  (?>.,  hypothesis  setting,  rigorous  statistical  design,  replication;  Resh  and 
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McElravy  1993)  or  qualitative  {le,,  rapid  assessment  procedures;  Resh  and  Jackson  1993). 
These  approaches  usually  compare  reference  sites  to  impacted  sites,  but  only  incorporate 
environmental  attributes  as  determinants  of  community  structure  through  inference.  In 
addition,  the  majority  of  these  approaches  have  been  applied  only  in  riffle  areas,  not  in 
depositional  zones. 

Conventional  multivariate  approaches  can  be  divided  into  two  types:  1)  multiple  regres¬ 
sion  and  2)  paired  matrices.  Multiple  regression  develops  relationships  between  benthic 
invertebrates  and  environmental  attributes,  but  it  only  considers  univariate  measures  of 
community  structure.  The  paired  matrices  approach  tries  to  identify  pattern  and  struc¬ 
ture  in  the  benthic  invertebrate  community  and  relate  that  to  environmental  variables 
{e.g.,  Wright  et  al  1984, 1988;  Corkum  and  Currie  1987;  Corkum  1989;  Johnson  and 
Wiederholm  1989;  Reynoldson  et  al  1995).  The  purpose  is  to  establish  predictive  mod¬ 
els  of  communities  that  can  be  used  to  assess  environmental  impairment.  If  the  predic¬ 
tive  models  are  based  on  reference  conditions,  then  the  models  derived  can  be  used  to 
measure  divergence  from  the  reference  state  and  the  degree  of  risk  associated  with  an 
environmental  stressor  (for  an  example  in  SERA,  see  Reynoldson  et  al  1995). 

In  a  different  multivariate  approach,  data  clusters  are  optimized  based  on  their  intended 
function  {i.e.,  the  accurate  prediction  of  properties  of  the  data)  rather  than  using  a  dis¬ 
tance  metric  or  similarity  function.  The  clustering  method  is  applicable,  without  further 
ad  hoc  assumptions  or  transformations  of  the  data,  as  follows:  1)  when  features  are  het¬ 
erogeneous  (both  discrete  and  continuous)  and  not  combinable,  2)  where  some  data 
points  have  missing  feature  values,  and  3)  where  some  features  are  irrelevant  (/>.,  have 
large  variance  but  little  correlation  with  other  features).  Further,  it  provides  an  integral 
measure  of  the  quality  of  the  resulting  clustering.  In  order  to  compute  these  relation¬ 
ships,  the  RIFFLE  clustering  program  is  used  (Matthews  and  Hearne  1991).  A  variety  of 
field  (Matthews,  Matthews,  and  Ehinger  1991;  Matthews,  Matthews,  and  Hachmoller 
1991)  and  microcosm  experiments  (Landis  et  al  1989,  1994;  Landis,  Matthews, 
Markiewicz,  and  Matthews  1993;  Landis,  Matthews,  Markiewicz,  Shough,  and  Matthews 
1993,  Matthews  et  al  1995)  show  that  the  conceptual  clustering  is,  in  many  respects, 
superior  to  traditional  methods. 

Identification  of  patterns  and  their  relationship  to  treatments  or  other  environmental 
descriptors  is  only  the  initial  step.  Visualization  of  the  relationships  and  the  resultant 
dynamics  can  aid  in  the  interpretation  of  the  data.  WORM  is  a  visualization  tool  used  for 
investigating  high-dimensional  time  series  from  ecotoxicological  dose-response  experi¬ 
ments  involving  treatment  groups.  It  allows  the  considerable  pattern  recognition  abilities 
of  humans  to  focus  on  the  task  of  investigating  complex  data.  Visualization  does  not  re¬ 
place  statistical  methods  but  rather  helps  researchers  to  gain  qualitative  insights  into  the 
data  and  to  form  hypotheses  that  can  then  be  investigated  with  formal  quantitative  meth¬ 
ods.  The  tool  uses  two-  and  three-dimensional  plots  and  animation  to  visually  compare 
variance  between  treatment  groups  and  between  the  replicates  within  each  individual 
treatment  group  over  the  course  of  the  experiment. 
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An  AI  program  called  RIGGLE  has  also  been  designed  to  perform  a  conceptual  temporal 
analysis  of  hypervariate  datasets  by  investigating  the  intervals  between  the  data  points 
rather  than  the  data  points  themselves.  A  set  theory  based  on  similarity  function  is  then 
employed  to  compare  replicates  of  an  experiment  or  of  different  experiments  and  to  iden¬ 
tify  population  dynamics  that  are  similar  but  may  not  be  occurring  in  exactly  the  same 
time  frame.  RIGGLE  has  been  applied  to  SAM  data  with  promising  results  (Landis, 
Matthews,  Markiewicz,  and  Matthews  1993;  Landis,  Matthews,  Markiewicz,  Shough, 
and  Matthews  1993;  Landis  et  al.  1994). 


12.3  Understanding  change  in  sediment  ecological  risk 
assessments 


12.3.1  Introduction 

Any  anthropogenic  impact  added  to  an  aquatic  ecosystem  is  layered  onto  the  natural 
patterns  of  community  structure  and  function  that  result  from  biotic  and  abiotic  factors 
acting  at  several  spatial  and  temporal  scales.  Therefore,  natural  variation  in  the  structure 
and  function  of  the  benthos  in  aquatic  ecosystems  must  be  separated  from  change  initi¬ 
ated  and  sustained  by  toxic  sediments  before  the  interpretation  and  ecological  relevance 
of  the  impact  can  be  understood.  This  requires  an  understanding  of  the  ecosystem  and  its 
trajectory  (change  over  time  from  natural  causes). 


12.3.2  Spatial  and  temporal  variability  of  sediment  ecosystems 

Biotic  communities  in  sediments  appear  to  consist  of  mosaics  of  patches  characterized  by 
different  species  compositions.  These  patches  are  believed  to  represent  different  stages  of 
recovery  from  a  range  of  external  physical  and  biological  disturbances  (reviewed  by  Hall 
et  al  1994)  although  the  relative  importance  of  the  various  forces  that  structure  the 
benthic  environment  remain  poorly  known  (Power  et  al  1988;  Posey  1990;  Hall  et  al 
1994).  Patterns  of  recovery  from  these  kinds  of  disturbances  are  generally  unpredictable 
because  of  variability  in  recolonization  caused  by  immigration  and  larval  recruitment. 
These  disturbances  may  affect  a  variety  of  spatial  scales,  making  the  interpretation  of 
anthropogenic  insult  difficult. 

Recolonization  and  succession  (z>.,  changes  in  assemblages  following  disturbance),  de¬ 
pend  on  the  supply  of  propagules  of  different  species.  This  supply  is  subject  to  spatial  and 
temporal  variability  such  as  the  timing  of  the  reproductive  cycle  (Whitlatch  and  Zajac 
1985)  and  patterns  of  drift.  For  example,  many  muddy-sandy  intertidal  areas  of 
Manukau  Harbor,  New  Zealand,  are  dominated  by  the  bivalves  Macomona  Uliana  and 
Chione  stutchburyi.  These  assemblages  are  subject  to  frequent  disturbance  by  wind-gen¬ 
erated  waves  (Turner  et  al  1995).  Recolonization  of  the  disturbed  areas  is  at  least  partly 
dependent  on  the  settlement  of  juvenile  bivalves.  This  natural  disturbance  and  recolo¬ 
nization  provides  heterogeneity  in  the  spatial  and  temporal  scales  that  must  be  recog¬ 
nized  before  the  impact  of  anthropogenic  insult  can  be  properly  interpreted. 

Estimation  of  normal  patterns  of  spatial  and  temporal  variation  is  a  function  of  the  pre¬ 
dicted  scale  and  organizational  level  of  the  insult.  Structural  responses  to  insult  may 
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appear  at  levels  of  organization  from  suborganismal  to  ecosystem,  including  the  organ¬ 
ism,  population,  and  community.  Functional  responses  are  likely  to  show  similar  hierar¬ 
chical  organization  from  physiological  to  population  (reproduction,  turnover)  to 
community  (primary  production,  decomposition,  respiration,  species  interactions)  lev¬ 
els.  Each  of  these  levels  of  organization  is  characterized  by  temporal  magnitude  ranging 
from  physiological  cycles  (reproduction,  respiration,  lifetime),  diurnal  cycles,  lunar 
cycles,  to  seasonal  cycles.  These  normal  functional  interactions  of  levels  of  organization 
and  time  may  be  overshadowed  by  the  presence  of  stochastic  events.  In  fact,  the  major 
structuring  pressures  in  natural  systems  may  be  dictated  by  rare  stochastic  events 
(Underwood  and  Denley  1984), 

As  an  example  of  the  potential  interactions  between  natural  spatial  variation  and  the  ef¬ 
fects  of  contaminants,  macrofaunal  populations,  and  communities  in  sandy  marine  sedi¬ 
ment  in  Botany  Bay,  Australia,  were  exposed  to  experimental  addition  of  copper  to  the 
sediment  (Morrisey  et  al  1995  and  in  review).  The  resulting  responses  were  not  spatially 
consistent;  they  varied  at  scales  of  meters  or  hundreds  of  meters.  This  variation  in  re¬ 
sponses  corresponded  to  patchiness  in  the  distribution  of  the  animals,  which  existed  at 
the  start  of  the  experiment.  The  patchiness  was,  presumably,  a  consequence  of  previous 
physical  and  biological  interactions. 

Bottom-up  {e,g.,  nutrient  availability;  Robbins  etal  1989)  or  top-down  {e.g,  predation 
and  competition)  controls  on  populations  may  determine  the  quantity  and  species  com¬ 
position  of  benthic  communities  in  the  absence  of  other  perturbations  (Hunter  and  Price 
1992).  The  relative  importance  of  all  of  these  structuring  forces  varies  through  time  and 
can  be  tied  to  normal  temporal  cycles.  For  example,  blooms  of  phytoplankton  may  re¬ 
duce  the  intensity  of  competitive  interactions  among  consumers. 

Ecosystems  are  constantly  moving  from  one  state  to  another  as  a  result  of  the  continual 
interaction  of  natural  perturbations  and  temporal  cycles  within  a  system.  These  succes- 
sional  changes  can  be  characterized  as  a  trajectory  of  change  for  a  given  system,  {e.g, 
change  toward  reconstruction  after  acute  perturbation).  The  trajectory  of  system  change 
will  be  a  strong  indicator  of  overall  system  function.  In  the  case  of  Manukau  Harbour  and 
the  bivalves,  the  ability  to  demonstrate  the  trajectory  of  population  recolonization  and 
succession  would  provide  an  assessment  endpoint  against  which  ecological  relevance 
could  be  interpreted.  Determination  of  such  trajectories  for  populations  requires  large 
studies  and  measurement  of  many  variables  for  a  large  number  of  reference  sites  against 
which  to  compare  the  test  site.  These  trajectories  may  be  visualized  and  measured  in  sev¬ 
eral  ways,  including  time  series  and  trend  analyses  {e.g,  Gottman  1981).  Two  recently 
developed  conceptual  clustering  techniques,  WORM  and  RIGGLE,  have  been  described 
above. 

The  Beyond  BACl  (before-after/ control-impact)  designs  for  detecting  environmental 
impacts  (Underwood  1993)  recognize  that  temporal  trajectories  in  populations  of  organ¬ 
isms  at  different  places  need  not  be  the  same  even  in  the  absence  of  an  impact.  This  is 
achieved  by  comparing  differences  in  changes  in  trajectories  before  and  after  the  putative 
impact  between  the  impacted  site  and  several  control  sites  with  differences  in  changes 
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among  the  control  sites.  The  differences  in  changes  among  control  sites  provides  an  es¬ 
timate  of  natural  variation  in  trajectories  among  places  in  the  absence  of  an  impact. 

12.3.3  Effects  of  contaminants  on  sediment  ecosystems 

Assessment  of  the  effects  of  contaminants  presumes  exposure;  thus,  to  develop  under¬ 
standing  of  the  ecological  relevance  of  the  measurement  endpoint,  the  exposure  condi¬ 
tion  producing  the  observed  change  needs  to  be  determined.  Direct  contaminant  effects 
are  a  combination  of  the  exposure  environment  and  duration  and  the  sensitivity  of  the 
organism.  Exposure  of  biota  to  contaminants  in  sediment  is  complicated  by  chemical 
interactions  among  the  particles  of  sediment,  the  pore  water,  the  overlying  water,  and  the 
contaminant  as  well  as  the  biological  behavior  of  the  organism.  Benthic  organisms  may 
be  exposed  to  the  contaminant  either  by  passive  diffusion  of  the  compound  through  the 
integument  from  the  pore  or  overlying  water  and  ingestion  of  particles  of  sediment 
coated  with  the  contaminant.  Any  determination  of  exposure,  as  well  as  the  interpreta¬ 
tion  of  effects,  is  facilitated  by  knowledge  of  more  than  the  external  concentration  as 
dose.  Significant  examples  are  available  that  demonstrate  that  the  external  dose  may  not 
be  sufficient  for  describing  exposure  to  benthic  organisms  (see  below).  Thus,  interpreta¬ 
tion  of  effects  at  varying  hierarchies  may  be  more  accurately  described  by  assessing  con¬ 
centrations  within  the  community  rather  than  individual  organisms.  This  measurement 
of  accumulation  represents  an  integration  of  the  toxicokinetics  of  exposure  for  a  species 
(but  see  Chapter  18).  Assessment  of  the  exposure  that  results  in  a  direct  effect  (the  ulti¬ 
mate  cause  of  any  observed  ecological  effects)  needs  to  be  delineated  and  the  causative 
interaction  defined;  interpretation  of  indirect  effects  will  be  more  complicated. 

Direct  ^ects  are  those  where  the  stressor  acts  on  the  ecological  component  of  interest.  In¬ 
direct  effects  are  those  where  the  stressor  acts  upon  the  component  of  interest  through 
supporting  components  of  the  ecosystem.  Direct  effects  at  the  individual  level  include 
mortality  and  changes  in  growth  and  behavior;  at  the  population  level,  they  might  in¬ 
volve  reproduction  or  demographic  structure;  and  at  the  community  level,  they  could 
involve  changes  in  diversity  and  species  dominance.  Indirect  effects  at  the  individual  level 
might  be  changes  in  growth  of  a  predator  because  of  food  limitation  manifested  when  a 
species  of  prey  is  removed  through  direct  mortality.  For  example,  Pratt  et  al  (1993)  ob¬ 
served  a  reduction  in  the  availability  of  food  for  deposit-  and  detritus-feeders  when  mi¬ 
crobial  populations  in  sediments  were  affected  by  contaminants. 

The  importance  of  species  interactions  {e.g.,  predation  and  competition)  in  structuring 
benthic  communities  has  received  considerable  attention  in  freshwater  and  marine  ecol¬ 
ogy  (review  of  rocky  intertidal  communities:  Dayton  1994;  reviews  of  freshwater  commu¬ 
nities:  McAulifife  1984;  Peckarsky  1984;  Sih  etal  1985;  Power  etal.  1988;  Power  1990). 
Understanding  how  species  interactions  are  affected  by  contaminants  has  been  largely 
ignored  probably  because  most  of  the  concerns  about  the  effects  of  contaminants  in  soft- 
substrate  environments  have  proved  much  less  tractable  to  experimental  investigation 
than  those  on  hard  substrata  described  in  the  references  cited  above.  Studies  of 
macrofaunal  communities  in  marine  sediments,  for  example,  have  so  far  produced  am- 


SETAC  Press 


178 


ECOLOGICAL  RISK  ASSESSMENT  OF  CONTAMINATED  SEDIMENTS 


bivalent  results,  and  there  are  no  general  models  of  how  these  communities  are  struc¬ 
tured  (e.g.,  Posey  1990;  Hall  et  al  1994). 

The  effects  of  contaminants  on  species  interactions  will  likely  depend  on  the  magnitude 
of  the  stress  and  the  relative  sensitivities  of  the  interacting  species.  Menge  and 
Sutherland  (1987)  developed  a  model  of  community  regulation  showing  that  the  relative 
importance  of  species  interactions  decreases  with  increased  environmental  stress.  If  en¬ 
vironmental  contaminants  are  analogous  to  environmental  stress,  then  similar  responses 
may  occur  along  contaminant  stress  gradients.  Changes  in  the  number  of  species  present 
in  successively  higher  trophic  levels  as  a  consequence  of  the  alteration  of  interspecific 
interactions  in  lower  levels  by  contaminants  have  been  modeled  by  Landis  etal  (1989). 
Warner  et  al  (1993)  found  that  the  outcome  of  interspecific  competition  between 
anurans  was  changed  by  acidification.  Wipfli  and  Merritt  (1994)  observed  that  reduced 
black  fly  density  in  a  larvicide-treated  stream  altered  species  interactions  and  indirectly 
affected  community  structure. 

The  specific  effects  of  contaminants  differ  depending  on  whether  consumers  or  their  prey 
are  more  susceptible  to  the  stress  (Menge  and  Olson  1990).  For  example,  if  predators  are 
more  sensitive  to  stress  than  their  prey,  the  consumer-stress  model  predicts  that  preda¬ 
tion  rates  will  be  lower  in  high-stress  environments.  This  pattern  was  observed  in  rocky 
intertidal  habitats  subjected  to  extreme  wave  action  (Menge  and  Olson  1990),  in  high 
gradient  streams  (Peckarsky  et  al  1990),  and  in  acidified  lakes  (Locke  and  Sprules  1994). 
In  contrast,  if  prey  are  more  sensitive  to  stress,  the  prey-stress  model  predicts  that  preda¬ 
tion  rates  will  be  greater  in  stressful  environments.  This  pattern  was  observed  in  the 
predator-prey  experiments  between  stoneflies  and  caddisflies  (Clements  etal  1989). 

These  authors  reported  that  susceptibility  of  net-spinning  caddisflies  to  stonefly  preda¬ 
tion  increased  in  streams  dosed  with  copper,  compared  to  control  streams.  Understand¬ 
ing  the  importance  of  indirect  effects  of  contaminants  may  help  reconcile  differences 
between  laboratory  and  field  responses.  For  example,  failure  to  show  direct  effects  in 
laboratory  toxicity  tests,  despite  altered  benthic  community  structure  observed  in  the 
field,  may  be  a  result  of  indirect  effects  or  the  benthos  is  responding  to  noncontaminant 
factors. 

The  lack  of  studies  on  indirect  effects  of  contaminants  (but  see  Hurlbert  1975)  is  prob¬ 
ably  related  to  the  difficulty  of  conducting  the  necessary  field  experiments  to  separate 
direct  effects  on  species  from  indirect  effects  on  species  interactions.  First,  it  must  be 
shown  that  species  interactions  are  important,  and  then  it  must  be  demonstrated  that 
these  interactions  are  affected  by  contaminants.  For  example,  caging  experiments  in  ref¬ 
erence  and  polluted  locations  could  be  used  to  evaluate  the  effects  of  predation  in  both 
situations.  Greater  stonefly  predation  on  benthic  invertebrate  communities  occurred  in 
metal-polluted  locations  than  reference  locations  as  measured  in  experimental  streams 
(Clements,  unpublished  data).  This  result  supports  the  hypothesis  that  contaminants 
increase  the  susceptibility  of  prey  organisms  to  predation.  The  same  experiment  could  be 
done  in  artificial  stream  mesocosms. 
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For  benthic  communities  structured  by  bottom-up  processes,  perturbations  caused  by 
either  natural  or  anthropogenic  stress  on  primary  production  will  be  observed  as  indirect 
effects  on  population  sizes  of  consumers  and  higher  trophic  levels.  Microbial  decompos¬ 
ers  can  be  similarly  affected  (Cairns  et  al  1992).  Effects  of  contaminants  in  sediments  in 
bottom-up  systems  are  more  likely  to  be  significant  in  assemblages  that  are  dominated  by 
detritus  and  deposit  feeders  than  when  filter  feeders  are  dominant  because  the  food  sup¬ 
ply  of  filter  feeders  is  less  likely  to  be  affected.  Posey  s  (1990)  review  of  the  success  of  func¬ 
tional  models  in  predicting  the  structure  of  communities  in  marine  sediments  suggests 
that  assemblages  in  which  both  trophic  groups  are  abundant  are  common  {cf.  the 
"trophic-group  amensalism”  hypothesis  proposed  by  Rhoads  and  Young  [1970])  to  ex¬ 
plain  the  structure  of  communities  in  marine  sediments).  Estuaries,  however,  provide  an 
exception  in  that  assemblages  are  often  dominated  by  deposit/detritus  feeders  and  their 
predators,  presumably  because  suspension  feeders  are  intolerant  of  the  large  loads  of 
suspended  material  occurring  in  these  environments.  Estuaries  are  particularly  prone  to 
the  accumulation  of  sediment-associated  contaminants  because  they  have  high  rates  of 
sedimentation  and  are  recipients  of  river-borne  contamination.  Thus,  estuaries  may  rep¬ 
resent  areas  where  contaminants  that  adversely  affect  the  lowest  trophic  levels  are  of 
greatest  ecological  significance,  assuming  that  bottom-up  factors  structure  the  commu¬ 
nity.  The  susceptibility  of  soft-sediment  communities  in  estuaries  to  natural  disturbances 
may  vary  geographically  and  may  be  large. 

Predators  may  be  able  to  influence  the  temporal  and  spatial  distribution,  species-density 
and  abundance  of  their  prey  (Ambrose  1991).  Thus,  in  top-down  systems,  control  of 
population  abundance  may  result  from  “trophic  cascades”  of  predator-prey  interactions. 
If  contaminants  affect  the  predator  more  than  the  prey,  the  prey  may  be  released  from 
trophic  control  and  bloom  as  an  indirect  effect  of  contamination.  For  example,  in  Norwe¬ 
gian  ^ords,  two  predatory  polychaetes,  Glycera  rouxii  and  Lumbrineris  sp.,  were  intolerant 
of  copper  contamination  (Rygg  1985).  Thus,  copper  contamination  could  release  their 
prey  from  predation  by  these  species,  assuming  that  the  prey  were  relatively  less  affected. 
However,  other  polychaete  groups  in  the  same  system,  Glycera  alba,  Goniada  maculata, 
and  Nephtys  spp.,  were  more  tolerant  of  copper,  so  that  if  one  species  of  predator  was 
affected,  another  could  simply  take  its  place.  Such  “functional  redundancy”  has  been 
cited  as  an  argument  against  the  use  of  the  structure  of  assemblages  as  a  reliable  surro¬ 
gate  for  the  protection  of  ecological  integrity  (Cairns  et  al.  1992).  Alternatively,  it  could  be 
argued  that  such  redundancy  makes  structural  changes  more  sensitive  indicators  of 
change  (Rapport  et  al  1985).  The  ecological  effects  are,  in  any  case,  likely  to  depend  on 
the  nature  of  the  stressor.  When  one  predator  species  is  substituted  for  another,  the  tra¬ 
jectory  of  the  system  will  be  affected  and  such  changes  in  trajectory  may  well  be  both 
sensitive  and  important  indicators  of  ecological  change. 

The  trajectory  of  a  community  through  time  will  likely  be  diverted  as  a  consequence  of 
these  various  effects  of  contaminants  at  different  scales  and  levels  of  organization.  For 
example,  the  recolonization  of  disturbed  areas  in  Manukau  Harbour  by  bivalves  (see 
above)  can  depend  not  only  on  spatial  and  temporal  variation  in  immigration  but  also  on 
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degree  of  contamination  in  the  environment.  Juveniles  of  the  bivalve  M.  Uliana  avoided 
laboratory  sediments  contaminated  with  copper  at  5  g.g'^  or  greater  (Roper  and  Hickey 
1994),  so  contamination  of  the  sediments  by  copper  could  retard  or  eliminate  the  poten¬ 
tial  for  recolonization  despite  the  availability  of  propagules.  This  would  disturb  the  nor¬ 
mal  trajectory  of  the  system.  However,  copper  concentrations  of  sediments  in  the  harbor 
are  in  the  range  of  20  to  30  g.g'^  and  populations  of  M.  Uliana  persist,  suggesting  that 
other  environmental  factors  or  bioavailability  issues  mitigate  the  inhibitory  effects  of 
copper  in  situ. 

The  persistence  of  M.  Uliana  populations  despite  high  copper  concentrations  in  the  sedi¬ 
ments  raises  the  problem  of  spatial  and  temporal  scale  in  studies  of  the  effects  of  con¬ 
taminants.  For  example,  if  the  structure  of  faunal  assemblages  is  reset  by  large-scale 
physical  disturbances  such  as  storms,  then  the  impact  of  contaminants  on  smaller-scale 
processes  such  as  competition  and  predation  may  be  relatively  unimportant.  In  reality, 
the  impact  of  contaminants  on  the  potentially  more  sensitive  larval  stages  may  dictate 
the  success  of  recolonization. 

12.3.4  Future  work 

Ecosystems  must  be  recognized  as  spatially  and  temporally  variable,  and  this  natural 
variability  and  how  it  relates  to  the  structure  of  benthic  communities  needs  to  be  estab¬ 
lished.  Information  is  also  needed  about  the  relative  variabilities  and  sensitivities  of 
benthic  communities  to  contaminants  under  different  environmental  conditions  and 
about  their  usefulness  as  measurement  endpoints  for  different  assessment  endpoints. 
Methods  should  be  developed  that  allow  determination  of  the  trajectory  of  natural  as¬ 
semblages  (communities)  in  light  of  recognized  temporal  and  spatial  structures  or  that 
allow  appropriate  comparisons  of  trajectories  among  reference  and  contaminated  sites 
(Underwood  1993).  Although  their  development  is  still  in  an  early  stage,  manipulative 
field  experiments  (Cooper  and  Barmuta  1993;  Hare  etal  1994;  Morrisey  etal  1995  and 
in  review)  offer  a  means  of  studying  the  effects  of  contaminants  on  structural  and  func¬ 
tional  variables  under  environmentally  realistic  conditions.  Manipulative  field  experi¬ 
ments  also  offer  a  high  potential  for  identification  of  cause  and  effect. 

12.4  Cause  and  effect 

12.4.1  Why  is  causation  important  to  sediment  ecological  risk  assessment? 

The  demonstration  of  causal  relationships  between  ecological  responses  and  anthropo¬ 
genic  stressors  is  important  in  SERAs  in  order  to  correctly  assign  an  underlying  reason 
for  change  in  an  ecological  measurement.  Expensive  remediation  and  litigation  decisions 
are  based  on  the  assumption  of  causality,  so  it  is  essential  that  SERAs  attempt  to  establish 
a  cause-and-effect  relationship  between  stressors  and  responses.  In  addition,  it  is  impor¬ 
tant  to  know  the  identity  of  the  stressor,  so  its  effects  can  be  mitigated. 

Direct  demonstration  of  causality  is  possible  only  when  all  relevant  factors  which  may 
mitigate  change  are  accounted  for  and  controlled.  Therefore,  in  risk  assessments  that  are 
based  on  traditional  toxicity  tests  using  spiked  sediment,  and  in  some  cases  of  modeling, 
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causality  is  not  in  doubt.  However,  predictions  about  the  nature  and  level  of  effects  under 
natural  conditions  may  be  doubtful  due  to  many  uncontrolled  but  mitigating  factors  (see 
Section  12.3.2  and  Chapter  10).  When  conditions  are  not  under  the  control  of  the  inves¬ 
tigator,  such  as  in  biological  surveys,  associations  with  toxicity  can  be  demonstrated  but 
causality  can  only  be  inferred.  In  such  cases,  causality  must  be  established  by  demon¬ 
strating  consistency  among  the  uncontrolled  variability  in  control  versus  impacted  areas. 
In  addition,  some  measurement  endpoints  for  uncontrolled  field  studies  can  be  useful  for 
inferring  causality  because  they  have  a  well-defined  relationship  to  standard  test  end¬ 
points.  Similarly,  some  test  endpoints  are  useful  for  demonstrating  causality  because  they 
have  well-defined  or  relatively  consistent  exposure-response  relationships  for  the  con¬ 
taminants  of  concern  or  because  they  have  well-defined  relationships  to  endpoint  effects 
in  the  field.  Demonstration  of  the  association  between  measurement  endpoints  from 
controlled  and  uncontrolled  studies  is  a  component  of  a  weight-of-evidence  analysis, 
which  also  considers  the  quality  of  the  various  lines  of  evidence.  The  analysis  of  consis¬ 
tency  is  typically  based  on  rules  such  as  Koch’s  (1966)  postulates  and  Hill’s  (1965)  crite¬ 
ria  but  may  be  statistical  or  use  problem-specific  inference  procedures  (Suter  1993). 

Every  measurement  endpoint  must  have  an  element  of  causality  to  link  the  response  to 
the  magnitude  of  the  stressor.  Depending  on  the  goals  of  the  risk  assessment,  the  specific 
identity  of  the  stressor  may  nor  may  not  be  important  to  know.  It  may  just  be  necessary 
to  know  that  a  characteristic  of  the  stressor  causes  the  effect.  Demonstrating  direct  cau¬ 
sation  will  not  be  possible  in  many  situations,  particularly  those  in  which  benthic  com¬ 
munities  are  surveyed;  however,  integrating  field  observations  with  laboratory 
experiments  and  developing  weight-of-evidence  approaches  will  help  support  the  hy¬ 
pothesis  that  changes  in  benthic  populations  or  communities  are  a  direct  result  of  sedi¬ 
ment  contaminants. 

The  ability  to  demonstrate  a  cause-and-effect  relationship  between  specific  stressors  and 
responses  is  influenced  by  1)  understanding  a  stressor’s  mode  of  action;  2)  complexity  of 
the  suite  of  stressors  (z>.,  multiple,  possibly  interacting,  modes  of  action);  3)  ability  to 
measure  exposure  at  scale  and  form,  relevant  to  an  endpoint;  4)  ability  to  discriminate 
between  natural  variability  (in  space  and  time)  and  contaminant-induced  response;  and 
5)  other  confounding  factors. 

12.4.2  Mode  of  action 

The  strength  of  the  cause-effect  relationship  will  be  affected  by  response-  and  stressor- 
related  factors.  Some  assessment  endpoints  are  directly  associated  with  a  specific  type  of 
stressor,  so  these  endpoints  may  be  useful  for  showing  cause-and-effect  relationships.  For 
example,  increased  production  of  metallothionein  is  generally  indicative  of  metal  expo¬ 
sure,  whereas  increased  primary  productivity  is  often  associated  with  nutrient  additions 
or  other  nonspecific  factors.  However,  theses  endpoints  may  not  respond  to  other  stres¬ 
sors  that  underlie  ecological  changes  in  the  sediment  of  concern.  Other  endpoints  com¬ 
monly  used  in  sediment  studies  may  respond  to  a  wide  variety  of  stressors  {e.g.,  growth 
can  change  in  response  to  contaminants,  food  limitation,  temperature,  or  other  factors; 
community  metrics  may  be  responsive  to  contaminants,  habitat  characteristics,  physical 
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disturbance,  interactions  among  species).  These  general  responses  can  provide  a  broad 
measure  of  stress,  but  inferring  causation  may  be  difficult.  Thus,  a  trade-off  exists  be¬ 
tween  endpoints  wit  high  sensitivity  to  specific  stressors  and  those  that  are  sensitive  to  a 
wide  variety  of  stressors. 

12.4.3  Complexify  of  stressors 

Certain  types  of  stressors  may  be  directly  tied  to  specific  effects,  especially  if  only  a  single 
stressor  is  present  or  the  stressors  are  spatially  distributed  along  a  linear  gradient.  This 
situation  is  usually  the  exception  rather  than  the  rule.  Most  sediment  assessments  have 
the  difficult  task  of  differentiating  among  multiple  stressors  (/>.,  mixtures  of  chemicals), 
between  chemical  contamination  and  habitat  modification,  or  among  stressors  distrib¬ 
uted  patchily  in  space  and  time.  Suter  (1993)  introduces  several  terms  to  classify  the  cu¬ 
mulative  effects  of  multiple  stressors: 

•  Nibbling:  “The  cumulative  effects  of  a  number  of  actions  which  have  similar 
small  incremental  effects...”  (p.  372) 

•  Time-crowded:  “...actions  [that]  are  so  close  in  time  that  the  system  has  not 
recovered  from  effects  of  one  before  the  next  one  occurs...”  (p.  373) 

•  Space- crowded:  “...actions  [that]  are  so  close  in  space  that  the  areas  within  which 
they  can  induce  effects  overlap.”  (p.  373) 

•  Indirect  “The  cumulative  effects  that  occur  when  the  direct  effects  of  actions  are 
not  space-  or  time-crowded,  but  their  indirect  effects  are.”  (p.  373) 

Separating  the  component  impacts  of  multiple  stressors  is  a  difficult  task,  and  methods 
to  address  this  issue  are  discussed  in  Chapter  16,  Bedford  and  Preston  (1988),  and 
Gosslink  et  al  (1990). 

12.4.4  Measurement  of  exposure 

Measurement  of  exposure  is  the  most  problematic  issue  in  defining  cause  and  effect  of 
sediment-associated  contaminants.  The  problem  lies  in  defining  the  dose  to  the  organism 
in  the  dose-response  relationship.  Sediment-associated  contaminants,  in  contrast  to 
aqueous  contaminants,  are  subject  to  a  greater  magnitude  and  range  of  factors  that  influ¬ 
ence  their  bioavailability  to  organisms.  The  toxicity  paradigm  for  ecological  testing  has 
generally  used  environmental  concentration  as  the  dose  to  which  the  organism  is  ex¬ 
posed.  For  aquatic  toxicity  tests,  this  stems  from  the  proportional  relationships  between 
the  dose  at  the  receptor,  the  dose  in  the  organism,  and  the  external  concentration.  These 
relationships  are  more  difficult  to  establish  for  sediment  exposures  because  of  many  fac¬ 
tors  that  can  alter  contaminant  bioavailability.  In  addition,  behavioral  responses  of  or¬ 
ganisms  to  contaminants  may  alter  exposure  (Kukkonen  and  Landrum  1994;  Landrum 
et  al  1994).  However,  in  these  cases,  the  dose  in  the  organism  {i.e.,  the  tissue  concentra¬ 
tion)  often  is  proportional  to  the  observed  response  and  may  be  a  better  indicator  of  ex¬ 
posure  (Landrum  etal  1994). 

Factors  that  influence  the  bioavailability  of  sediment-associated  contaminants  include 
character  of  the  sediment,  characteristics  of  the  contaminants  and  behavior  of  the  organ- 
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ism  (Hamelink  et  al.  1994).  Thermodynamic  equilibrium  models  have  proven  useful  in 
predicting  the  bioavailability  of  hydrophobic  organic  contaminants  (Di  Toro  etal.  1991) 
and  some  metals  in  sediment  (Di  Toro  et  al  1990;  Ankley  et  al  1994;  DeWitt  et  al,  in 
press),  particularly  for  homogeneous  and  temporally  undisturbed  sediments.  Under 
more  variable  conditions,  simple  equilibrium  relationships  may  break  down  (Harkey, 
Landrum,  and  Klaine  1994;  Landrum  etal  1994, 1995).  For  example,  if  concentrations  of 
contaminants  or  normalizing  factors  (z>.,  OC  or  AVS)  are  vertically  or  horizontally  strati¬ 
fied  within  the  sediment,  organisms  burrowing  through  such  strata  will  likely  experience 
differences  in  exposure  over  relatively  small  spatial  scales  that  may  not  be  reflected  in  the 
usual  bulk  measurements  of  sediment  properties  or  chemistry  (Lee  1991).  Additionally, 
the  behavior  of  benthic  organisms  may  affect  their  exposure  as  a  consequence  of  1)  feed¬ 
ing  mode,  which  can  affect  the  routes  of  exposure  {le.,  deposit  versus  filter-feeding; 
Harkey,  Lydy  et  al  1994);  2)  bioturbation,  which  can  change  local  sediment  geochemis¬ 
try  and  thus  alter  local  contaminant  bioavailability;  or  3)  dispersal  ability,  which  can  alter 
the  duration  of  exposure  to  contaminated  sediments.  These  factors  can  confound  the  es¬ 
timation  of  exposure  for  organisms  in  the  field  when  exposure  is  based  simply  on  either 
a  whole  sediment  or  some  normalized  concentration.  Consequently,  assessments  of  ex¬ 
posure  within  sediments  would  be  improved  by  including  tissue-level  concentrations  as 
well  as  measurements  (or  predictions)  of  the  bioavailable  fraction  of  contaminants  in 
sediments. 

12.4.5  Separating  natural  background  variation  from  anthropogenic 
variation 

Natural  systems  are  inherently  variable  in  space  and  time.  In  general,  if  we  look  closely 
enough  or  take  enough  samples,  we  will  almost  always  be  able  to  detect  differences  be¬ 
tween  two  locations,  regardless  of  the  degree  of  anthropogenic  stress.  Therefore,  one  of 
the  most  important  issues  in  establishing  causal  relationships  between  sediment  con¬ 
taminants  an  ecological  effects  is  to  separate  natural  spatial-temporal  variation  from 
anthropogenic  variation.  For  example,  benthic  macroinvertebrate  communities  are 
strongly  influenced  by  substrate  composition  (grain  size,  percent  OC;  Sanders  1960).  On 
a  larger  scale,  it  is  well  established  that  structure  and  function  of  streams  vary  naturally 
from  upstream  to  downstream  (Vannote  etal  1980).  Although  this  longitudinal  variation 
is  predictable,  it  also  must  be  accounted  for  when  designing  SERAs. 

Understanding  natural  variation  is  the  key  to  separating  it  from  anthropogenic  variation. 
The  best  understanding  is  gained  form  long-term  studies  over  relatively  broad  spatial 
scales,  such  as  those  conducted  at  the  Experimental  Lakes  Area  (ELA)  in  northern  On¬ 
tario  (Heckey  et  al  1994)  or  at  the  National  Science  Foundation  s  Long-Term  Ecological 
Research  (LTER)  sites  (Callahan  1984). 

Experimental  manipulation,  either  in  the  laboratory  or  in  the  field,  is  the  most  direct 
method  of  separating  natural  from  anthropogenic  variation.  For  example,  laboratory 
studies  can  be  conducted  to  measure  the  direct  effects  of  a  particular  confounding  vari¬ 
able  {e.g.,  grain  size,  OC)  on  survival,  growth,  or  reproduction  (DeWitt  et  al  1988).  Simi- 
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larly,  microcosm  or  mesocosm  experiments  can  be  used  to  measure  the  influence  of  con¬ 
founding  variables  on  community  structure  or  ecosystem  processes. 

Well-designed  field  studies,  with  appropriate  reference  sites  and  true  replicates,  can  also 
be  used  to  understand  the  relative  importance  of  natural  variation.  Multiple  regression 
models  and  multivariate  procedures  can  be  developed  to  estimate  the  importance  of 
potential  confounding  variables  on  benthic  community  endpoints  (e.g.,  Clements  1994), 
One  potential  limitation  of  these  field  procedures  is  that  large  amounts  of  data  are  gen¬ 
erally  necessary  to  produce  robust  models. 

12.4.6  Confounding  factors 

Noncontaminant  factors  can  confound  the  response  of  a  measurement  endpoint  and 
lead  to  incorrectly  ascribing  high  environmental  risk  to  a  stressor  of  concern.  For  ex¬ 
ample,  survival  of  some  marine  amphipods  can  be  significantly  affected  by  the  grain  size 
of  the  sediment  (DeWitt  et  al  1988)  or  ammonia  concentration  in  the  pore  water  (Ankley 
et  al  1990;  Kohn  et  al  1994).  Failure  to  recognize  these  factors  can  lead  to  erroneous  at¬ 
tribution  of  toxicity  to  persistent  contaminants  present  in  the  sediments  at  sublethal  con¬ 
centrations.  Other  confounding  factors  within  sediment  habitats  include  both  abiotic 
0>.,  salinity,  temperature,  pH,  dissolved  oxygen,  hydrogen  sulfide,  alkalinity,  hardness, 
light)  and  biotic  parameters  {le,,  food  limitation,  predators,  competitors,  disease,  bio¬ 
genic  disturbance).  Typically,  confounding  factors  depress  the  measurement  endpoint 
response  {le.,  reduce  survival,  decrease  growth);  however,  the  opposite  can  also  occur, 
such  as  when  a  test  organism  fails  to  burrow  into  sediment  because  of  anoxic  conditions 
or  is  less  exposed  to  sediment-associated  contaminants  and  therefore  experiences  greater 
than  expected  survival  or  growth.  In  laboratory  studies,  use  of  appropriate  controls  and 
monitoring  of  water  quality  conditions  can  greatly  reduce  the  influence  of  confounding 
factors.  In  field  studies,  however,  confounding  factors  may  be  much  more  difficult  to 
account  for  because  of  spatial  and  temporal  variability.  Knowledge  of  the  physiological 
tolerances  of  the  species  of  interest  and  their  interactions  with  other  organisms  at  the 
study  site  can  provide  inferential  guidance  to  the  potential  influence  of  confounding  fac¬ 
tors  in  the  field,  but  manipulative  field  experiments  may  be  required  to  unequivocally 
measure  the  impact  of  these  confounding  factors. 

12.4.7  Assessment  of  causation 

Several  laboratory  and  field  approaches  may  be  used  to  establish  cause-and-effect  rela¬ 
tionships  between  stressors  and  endpoints  (Table  12-1).  Of  particular  relevance  is  the  use 
of  TIE  procedures  which  can  be  defined  as  “a  stepwise  process  that  combines  toxicity  test¬ 
ing  and  analysis  of  the  physical  and  chemical  characteristics  of  samples  to  identify  poten¬ 
tially  causative  toxicants”  (USEPA  1985;  and  see  Chapter  16).  Laboratory  toxicity  tests 
provide  the  greatest  control  over  confounding  variables  and  are  highly  replicable.  How¬ 
ever,  laboratory  toxicity  tests  have  greatly  limited  spatial  and  temporal  scales  and  lack 
ecological  realism.  Field  biomonitoring  approaches  may  be  conducted  over  broader  spa¬ 
tial  and  temporal  scales  but  are  poorly  controlled  with  regard  to  confounding  variables. 
Controlled  mesocosm  studies  provide  the  highest  degree  of  environmental  realism  and 
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Table  12-1  Usefulness  of  laboratory  and  field  approaches  for  assessing  cause-and-effect  relationships 
between  stressors  and  endpoints 


Approach 

Control  of 

confounding 

variables 

Replicability 

Spatial- 

temporal 

scale 

Ecological 

realism 

Laboratory  toxicity  tests 

high 

high 

low 

med 

Site-specific  biomonitoring 

low 

med 

med 

high 

Large-scale  biomonitoring  studies 

med 

med 

high 

high 

Paleoecology 

low 

low 

high 

med 

Natural  experiments 

med 

low 

med 

high 

Ecosystem  models 

med 

high 

high 

med 

Microcosms/mesocosms 

med 

med 

med 

med 

Ecosystem  manipulations 

low 

low 

high 

high 

may  be  conducted  over  relatively  broad  spatial  and  temporal  scales,  but  these  approaches 
also  lack  control  over  confounding  variables.  As  strength  of  inference  is  related  to  a 
weight-of-evidence  approach,  consistency  of  relationships  among  replicates  in  laboratory 
studies;  correspondence  between  field  evidence,  laboratory  experiments,  and  published 
literature;  and  biomonitoring  information  should  all  be  utilized  to  make  sound  decisions 
in  SERAs. 

12.5  Predictive  and  retrospective  inference 

12.5.1  Introduction 

Assessment  endpoints  in  SERAs  usually  have  some  relevance  in  terms  of  legal  mandates, 
regulatory  policies,  stakeholder  interests,  and/or  broad  societal  values  because  risk  man¬ 
agers  are  involved  in  the  process  of  selecting  such  endpoints.  However,  during  the  risk 
assessment  process,  the  question  becomes  one  of  ecological  relevance:  In  what  ways  are 
the  measurement  endpoints  that  are  available  from  the  literature  or  potentially  available 
from  ad  hoc  testing  and  measurement  activities  relevant  to  the  chosen  assessment  end¬ 
points? 

Types  of  measurement  endpoints  from  which  inferences  are  made  and  the  types  of  infer¬ 
ence  to  be  made  can  be  either  predictive  or  retrospective.  Inferences  can  be  made  from 
single  measurement  endpoints  or  combinations  of  measurement  endpoints.  Inference 
from  a  single  measurement  relies  on  models  relating  the  measurement  endpoint  to  the 
assessment  endpoint,  whereas  statistical,  mathematical,  or  conceptual  inferences  from 
multiple  lines  of  evidence  use  logical  or  rule-based  weight  of  evidence. 
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12.5.2  Inference  from  biological  survey  endpoints 

Biological  surveys  are  activities  that  sample  biological  populations  or  assemblages  to 
estimate  their  properties.  For  example,  the  mean  number  of  species  per  m^  the  total 
number  of  species,  and  the  mass  of  macroinvertebrates  per  m^  are  potential  endpoints 
from  a  survey  of  benthic  macroinvertebrates  conducted  by  collecting  multiple  Ponar 
samples  at  sites  on  a  transect.  It  is  often  assumed  that  the  organisms  collected  from  sedi¬ 
ment  have  been  chronically  exposed  to  the  contaminants  in  those  sediments  because  the 
invertebrate  biota  are  sedentary. 

All  inferences  from  biological  surveys  involve  some  comparison  to  a  reference  state.  This 
may  be  done  in  a  number  of  ways,  and  the  choice  influences  the  types  of  inferences  that 
can  be  made. 

•  Transect  methods  are  quire  useful  when  contaminants  have  a  distinct  source 
(e.g.,  an  outfall  or  tributary)  or  location  (e.g,  a  spoil  mound).  In  such  cases,  the 
reference  is  the  portion  of  the  transect  in  which  the  measurement  endpoint 
achieves  the  asymptotic  condition.  This  approach  requires  reasonable  unifor¬ 
mity  of  the  physical-chemical  properties  of  the  sediment  in  order  to  avoid 
confusion.  In  theory,  one  could  identify  relevant  physical-chemical  properties 
and  eliminate  their  influence  if  their  distribution  was  independent  of  the 
distribution  of  contaminants,  but  such  an  approach  is  not  likely  to  succeed  in 
practice. 

•  Where  transects  are  not  practical,  community  properties  from  the  contami¬ 
nated  site  must  be  compared  to  those  of  less-contaminated  sites.  If  biological 
surveys  are  to  provide  the  sole  or  primary  line  of  inference,  then  it  is  important 
to  have  multiple  reference  sites  in  order  to  answer  the  question  “Is  the  contami¬ 
nated  site  similar  to  or  different  from  uncontaminated  sites?”  rather  than  “Are 
the  two  groups  of  sites  similar  or  different?”  The  latter  question  is  of  little 
interest  since  two  sites  may  differ  for  any  number  of  reasons  other  than  con¬ 
tamination.  If  the  assessment  is  to  rely  primarily  on  sediment  toxicity  testing  or 
single-chemical  toxicology,  then  less  rigorous  designs  may  be  used  to  support 
the  results  of  those  inferences.  For  example,  if  the  sediment  is  toxic  to  the  test 
organisms  and  is  contaminated  to  levels  that  are  estimated  to  result  in  toxic 
exposures,  then  the  fact  that  the  species  richness  of  the  contaminated  site  and  a 
reference  site  differs  by  30%  serves  to  support  those  inferences  even  though  it 
has  little  utility  alone. 

•  A  survey  of  reference  communities  is  sometimes  not  necessary.  If  a  sediment 
supports  only  a  few  taxa  that  are  known  to  be  resistant  to  a  contaminant,  then 
little  need  exists  to  expend  resources  on  sampling  and  characterizing  reference 
communities.  It  is  sufficient  to  assure,  based  on  knowledge  of  habitat,  that  the 
sediment  is  suitable  for  a  different  sort  of  community  in  the  absence  of  contami¬ 
nation. 

•  Paleoecological  references  may  be  used  if  sediments  are  bedded  so  that  current 
community  composition  can  be  compared  to  predisturbance  composition  at  the 
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same  location.  Inference  is  limited  by  confounding  historical  changes  such  as 
changes  in  sediment  texture  or  frequency  of  anoxia. 

Biological  survey  results  are  seldom  used  in  predictive  assessments,  and  it  is  unlikely  that 
they  would  be  used  as  the  sole  line  of  evidence  for  inferring  effects.  However,  inference 
from  similarly  contaminated  sites  to  a  site  that  is  proposed  for  contamination  is  prudent, 
given  the  many  uncertainties  associated  with  predicting  the  responses  of  benthic  com¬ 
munities  to  contaminated  sediments.  For  example,  the  best  evidence  concerning  the  risks 
to  the  benthic  community  of  a  proposed  disposal  site  could  be  surveys  of  the  effects  of 
disposal  of  sediments  from  prior  dredging  of  that  harbor.  The  relevance  in  this  case  is 
limited  by  changes  in  the  contaminant  composition  of  the  sediment  and  the  ability  of 
available  survey  techniques  to  measure  properties  that  are  closely  related  to  the  assess¬ 
ment  endpoint. 

Measurement  endpoints  that  are  direct  estimators  of  the  assessment  endpoint  can  be 
selected  when  planning  biological  surveys.  For  example,  if  the  assessment  endpoint  is  a 
reduction  in  species  richness  of  benthic  invertebrates  in  a  river  reach,  the  measurement 
endpoint  may  be  mean  species  richness  in  samples  from  that  reach.  The  inference  is 
largely  statistical  (e.g.,  How  representative  are  the  samples,  and  what  summary  statistics 
best  describe  the  sampled  community?)  but  also  includes  assumptions  about  methods 
(e.g.,  the  consistency  of  taxonomic  identification).  Inferences  from  these  measurement 
endpoints  to  assessment  endpoints  are  relatively  simple  and  direct  in  most  cases  because 
assessment  endpoints  correspond  to  conventional  measurement  endpoints.  However, 
recall  that  inferences  about  causation  cannot  be  made  from  these  measurement  end¬ 
points  alone. 

It  is  also  possible  to  measure  an  intermediate  stage  in  an  indirectly  induced  effect  and  to 
model  the  effects  on  the  assessment  endpoint  (e.g.,  measure  invertebrate  abundance  and 
model  the  effects  of  the  loss  of  food  on  benthic  fish). 

12.5.3  Inference  from  biomorkers  and  gross  pathologies 

Biomarkers  include  various  biochemical  and  histological  measures  of  the  internal  state  of 
organisms  (Melancon  1995).  They  are  seldom  used  with  benthic  invertebrates  but  are 
used  with  fish,  including  epibenthic  species.  Gross  pathologies  include  deformities,  le¬ 
sions,  and  tumors.  Tumors  and  fin  erosion  have  often  been  associated  with  contaminated 
sediments. 

When  predicting  effects  of  future  contamination,  it  is  more  useful  to  do  a  toxicity  test 
that  measures  responses  such  as  mortality,  growth,  and  reproduction  (endpoints  relevant 
to  population  and  community  properties)  than  to  test  for  biomarker  responses  because 
the  relationship  of  biomarkers  to  organismal,  population,  or  community  effects  is  not 
clear.  Gross  pathologies  are  not  usually  included  in  toxicity  tests,  although  developmen¬ 
tal  abnormalities  are  relevant  to  population  effects  if  it  is  assumed  that  abnormal  organ¬ 
isms  will  not  reach  maturity. 
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Biomarkers  and  pathologies  have  no  known  ecological  relevance,  so  they  cannot  be  used 
alone  to  make  inferences  about  ecological  risks.  However,  they  may  be  used  as  support¬ 
ing  evidence.  For  example,  the  presence  of  gross  or  histological  pathologies  characteris¬ 
tic  of  a  particular  contaminant  or  class  of  contaminants  will  support  the  inference  that 
toxicity  is  a  cause  of  a  depauperate  fish  assemblage.  In  addition,  acetylcholinesterase  in¬ 
hibition  may  be  used  to  diagnose  death  resulting  from  organophosphate  pesticides. 

12.5.4  Inference  from  sediment  toxicity  test  endpoints 

Toxicity  tests  that  expose  organisms  to  whole  sediments  that  have  been  contaminated  in 
situ,  or  to  pore  waters  or  elutriates  derived  from  whole  sediments,  are  considerably  less 
well  developed  than  those  for  water.  Inference  from  the  existing  standardized  tests  is  lim¬ 
ited  by  the  small  number  of  sublethal  test  endpoints,  particularly  reproductive  end¬ 
points,  and  the  lack  of  systematic  validation  relative  to  field  effects.  Currently,  we  know 
that  sediments  that  kill  organisms  in  the  laboratory  are  likely  to  show  depauperate  com¬ 
munities  in  the  field,  but  we  do  not  know  that  benthic  communities  that  are  significantly 
affected  in  the  field  inhabit  sediments  that  would  be  toxic  in  the  standard  tests.  Like  bio¬ 
logical  surveys,  inference  is  limited  by  sampling  issues  {i.e.,  the  ability  of  the  tested 
samples  to  represent  the  sediments  to  which  the  endpoint  community  is  exposed)  and  by 
the  need  to  use  reference  sediments  in  whole  sediment  toxicity  tests. 

These  tests  use  contaminated  sediments  so  they  are  not  used  in  predictive  assessments. 
However,  they  can  be  useful  in  specific  circumstances.  For  example,  such  sediments  could 
be  used  to  determine  a  site-specific  exposure-response  relationship  if  a  contamination 
gradient  occurs  in  space.  That  relationship  could  then  be  used  to  predict  changes  in  time 
if  concentrations  are  changing  as  a  result  of  degradation  or  other  processes. 

Sediment  toxicity  tests  may  be  used  alone  to  infer  that  a  sediment  poses  unacceptable 
risks.  That  is,  if  amphipods  die  in  a  test,  then  it  is  reasonable  to  assume  that  organisms 
in  the  field  will  be  killed  by  the  sediment.  Inferences  could  be  improved  if  well-designed 
validation  studies  established  statistical  models  relating  test  endpoint  to  field  measure¬ 
ment  endpoint  that  correspond  to  common  assessment  endpoints.  Alternatively,  demo¬ 
graphic  models  or  individual-based  models  could  be  used  to  infer  population-level  effects 
from  measurement  endpoints  that  are  relevant  parameters  of  the  models  {e.g., 
survivorship  and  fecundity  for  the  demographic  models;  DeWitt  1994).  If  biological  sur¬ 
vey  data  are  available,  then  the  sediment  toxicity  data  need  not  be  used  to  make  infer¬ 
ences  about  the  effects  on  the  assessment  endpoint.  Instead,  sediment  toxicity  data  can 
be  used  to  confirm  that  the  effects  detected  by  a  survey  are  associated  with  toxicity,  so 
causation  can  be  inferred. 

12.5.5  Inference  from  single  chemical  toxicity  test  endpoints 

Traditional  toxicity  tests  involve  addition  of  individual  chemicals  to  a  clean  sediment  or 
water  to  generate  test  endpoints  such  as  LCSOs,  NOECs,  or  exposure-response  functions. 
The  inference  problems  of  contaminated  sediment  toxicity  tests  are  also  applicable  here. 
In  addition,  inference  from  these  tests  involves  assumptions  concerning  combined  toxic 
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effects  of  mixtures  (unless  one  contaminant  dominates)  and  exposure  (i.e.,  bioavailabil¬ 
ity  and  speciation)  that  are  avoided  by  contaminated  sediment  tests  (see  Chapter  18). 

Most  efforts  to  develop  models  to  extrapolate  to  assessment  endpoints  have  been  de¬ 
voted  to  endpoints  from  single  chemical  tests.  Various  types  of  models  have  been  devel¬ 
oped  for  this  purpose,  but  they  are  only  beginning  to  be  implemented  for  sediment 
community  endpoints  (Suter  1993).  Species-sensitivity  distributions,  regression  models, 
toxicodynamic  models,  and  other  types  of  extrapolation  models  that  have  been  devel¬ 
oped  for  estimating  effects  on  aquatic  populations  and  communities  from  conventional 
test  endpoints  can  be  applied  to  benthic  species  as  well.  Therefore,  inference  from  this 
class  of  measurement  endpoints  alone  currently  relies  on  conceptual  models  (e.g.,  in  the 
field,  if  the  LC50  is  exceeded,  organisms  will  die,  some  benthic  species  are  likely  to  go 
extinct,  and  species  richness  will  be  reduced). 

Use  of  single  chemical  test  endpoints  alone  in  retrospective  assessments  is  subject  to  the 
same  limitations  as  in  predictive  assessments.  If  the  contaminated  sediments  have  been 
tested  or  the  community  has  been  surveyed,  then  single  chemical  test  endpoints  can  be 
used  to  identify  the  chemicals  that  are  responsible  for  the  toxicity  and  to  infer  whether 
community  degradation  is  likely  as  a  result  of  a  particular  chemical  or  source. 

12.5.6  Inference  from  contaminants  in  biota 

Benthic  organisms  accumulate  contaminants  from  sediments  and  transfer  them  along 
trophic  webs  to  fish  and  wildlife.  Hence,  benthos  are  a  source  of  exposure  to  other  end¬ 
point  populations  and  communities,  as  well  as  being  an  endpoint  community  in  them¬ 
selves  (discussed  above).  Some  models  are  available  for  estimating  contaminant  uptake 
from  sediments,  but  the  complexity  and  diversity  of  sediments  makes  the  predictions 
highly  uncertain  and  not  applicable  to  all  classes  of  chemicals.  Therefore,  concentrations 
of  contaminants  of  concern  should  be  measured  in  both  the  biota  and  the  sediments  to 
which  they  are  exposed  along  with  the  various  laboratory  and  field  measurements  that 
serve  to  generate  measurement  endpoints  for  the  benthic  biota.  Concentrations  of  con¬ 
taminants  in  the  biota  can  be  used  to  estimate  risks  to  avian  and  mammalian  wildlife 
using  conventional  dietary  exposure  models  and  oral  toxicity  endpoints  (Suter  1993, 
USEPA 1994).  Risks  to  fish  that  feed  on  benthic  organisms  are  difficult  to  estimate  from 
concentrations  in  food  because  that  mode  of  exposure  has  received  very  little  consider¬ 
ation  (Woodward  et  al.  1994).  Thus,  concentrations  in  benthic  organisms  can  only  be 
used  as  supporting  information  to  establish  that  a  route  of  exposure  exists. 

12.6  Recommendations  and  critical  issues 

12.6.1  Reference  conditions 

Sediment  ecological  risk  assessments  involve  comparison  of  potential  stressors  with  one 
or  more  reference  conditions.  Establishment  of  appropriate  reference  conditions  has 
been  problematic  for  many  studies,  largely  because  of  a  lack  of  knowledge  concerning 
natural  ecological  processes.  The  following  recommendations  concern  the  establishment 
of  reference  conditions: 
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•  Contributions  to  the  systematics  (classification)  of  marine  and  freshwater 
benthic  organisms.  Systematics  is  at  the  base  of  our  understanding  of  the 
natural  world.  Unfortunately,  systematics  has  become  an  unfashionable  activity 
in  today  s  highly  technical  world,  and  our  knowledge  of  biodiversity  is  languish¬ 
ing  (Wilson  1992).  This  area  of  basic  biology  needs  more  —  not  less  —  atten¬ 
tion  in  SERAs.  In  addition,  the  taxonomic  levels  necessary  to  detect  in 
ecological  assessments  need  to  be  determined. 

•  Develop  regional  reference  bases  for  benthic  community  structure  in  fine¬ 
grained  depositional  sediments.  The  establishment  of  adequate  reference  or 
control  sites  has  long  been  a  difficulty  in  field-based  investigations.  Three 
different  approaches  have  recently  been  developed  to  circumvent  this  problem. 
The  regional  reference  site  approach  (e.g.,  Plafkin  et  al  1989)  uses  a  biocriteria 
approach,  whereas  the  RIVPACS  model  from  the  UK  (Wright  et  al  1984, 1988) 
and  the  BEAST  model  from  Canada  (Reynoldson  etal  1995)  use  multivariate 
approaches.  Similar  results  can  be  achieved  by  conceptual  clustering  (Matthews 
and  Hearne  1991;  Landis  etal  1994),  a  nonmetric  technique.  Whatever  method 
is  used,  regional  sites  should  be  established  and  sampled  before  more  anthropo¬ 
genic  disturbance  occurs  and  their  usefulness  as  “non-impacted”  sites  is 
eliminated. 

12.6.2  Exposure 

Exposure  to  sediment-associated  contaminants  is  complicated,  and  our  current  level  of 
understanding  limits  the  probability  of  an  accurate  risk  assessment  for  sediment-associ¬ 
ated  contaminants.  The  following  recommendations  would  improve  our  overall  under¬ 
standing  of  exposure  to  contaminants: 

•  Guidance  should  be  developed  to  assist  regulators  in  selecting  ecologically 
relevant  endpoints  to  reflect  the  appropriate  routes  and  levels  of  exposure. 

•  Better  understanding  of  exposure,  appropriate  for  each  level  of  trophic  organi¬ 
zation  needs  to  be  developed,  to  account  for  differences  in  the  interactions  of 
benthic  species  (both  animals  and  plants)  with  sediments. 

•  Methods  are  needed  to  characterize  exposure  to  individual  stressors  from  the 
many  different  natural  and  anthropogenic  stressors  to  which  benthic  organisms 
are  exposed.  Methods  are  also  needed  to  improve  our  ability  to  predict  exposure 
to  contaminant  mixtures. 

•  Better  ability  is  needed  to  predict  the  effects  of  body  residues  on  benthic  biota. 
Improving  relationships  between  accumulation  of  contaminants  and  expected 
effects  will  improve  both  interpretation  of  bioaccumulation  data  and  our 
confidence  to  explain  the  cause  of  observed  ecological  disturbances. 

•  Better  methods  are  required  to  assess  the  influence  of  benthic  community 
structure  and  function  (and  variation  in  both)  on  contaminant  concentrations 
and  bioavailability.  Benthic  organisms  (especially  deposit  feeders  and 
bioturbators)  can  substantially  alter  the  biogeochemistry  of  sediments  they 
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inhabit,  yet  little  is  know  about  the  activity  of  theses  organisms  on  the  fate  of 
sediment-bound  contaminants. 

12.6.3  Effects 

Ecologically  relevant  responses  to  sediment-associated  contaminants  is  complicated,  and 
our  current  level  of  understanding  limits  the  probability  of  an  accurate  ERA  for  sediment- 
associated  contaminants.  The  following  recommendations  would  improve  our  overall 
understanding  of  responses  to  contaminants: 

•  A  need  exists  to  determine  whether  SQGs  (e.g.,  SQC)  are  protective  or  ecologi¬ 
cally  relevant  within  the  context  of  SERAs. 

•  The  current  suite  of  sediment  toxicity  tests  and  their  ability  to  predict  effects  on 
populations  and  communities  in  diverse  biogeographic  locations  should  be 
systematically  validated.  The  results  of  theses  studies  should  be  used  to  direct 
the  development  of  new  sediment  toxicity  tests  and  to  assess  and  develop 
methods  to  extrapolate  from  sediment  toxicity  tests  to  endpoint  properties  in 
the  field. 

•  Existing  QSARs  should  be  systematically  created  and  evaluated  for  phylogeneti- 
cally  diverse  sets  of  benthic  animals  and  plants. 

•  Mechanisms  underlying  contaminant  effects  at  different  levels  of  ecological 
organization  and  linkages  among  them  need  to  be  understood.  Furthermore,  a 
need  exists  to  understand  how  contaminants  affect  ecological  change  through 
time  (trajectory). 

•  Better  methods  need  to  be  developed  to  measure  spatial  and  temporal  variation 
in  structural  and  functional  properties  of  benthic  communities  and  to  measure 
and  predict  the  trajectory  of  change  (temporal)  of  ecological  variables,  espe¬ 
cially  at  community  and  ecosystem  levels.  Studies  are  needed  to  better  under¬ 
stand  how  spatial  and  temporal  patchiness  in  ecological  responses  affect 
prediction  and  detection  of  impacts. 

•  Better  understanding  is  needed  of  direct  effects  of  contaminants  on  inter-  and 
intraspecific  interactions,  succession,  and  other  community-level  processes,  and 
on  ecosystem-level  processes  (nutrient  or  energy  cycling,  metabolism,  produc¬ 
tivity). 

•  Methods  need  to  be  developed  to  evaluate  the  effects  of  mixtures  of  toxicants  in 
laboratory  experiments  and  in  field  studies,  and  the  relative  importance  of  an 
individual  stressor  from  suites  of  multiple  stressors  in  field  studies. 
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_ Chapter  13 

Issues  of  uncertainty  in 
ecological  risk  assessments 

William  J  Warren-Hicks  Jonathan  B.  Butcher 


13.1  Introduction 

Uncertainty  analyses  seek  to  describe  and  interpret  lack  of  knowledge  that  may  be 
present  in  the  implementation  or  interpretation  of  a  risk  analysis.  Depending  on  the 
specific  goals  of  the  risk  analysis  and  on  the  availability  of  information,  the  uncertainty 
analysis  can  take  many  forms.  For  example,  the  analysis  can  be  quantitative  or  qualita¬ 
tive,  predictive  or  retrospective,  model-based  or  empirical,  experimental  or  survey-ori¬ 
ented.  The  number  of  available  approaches  for  documenting  and  assessing  the 
magnitude  of  uncertainty  is  large.  Given  the  array  of  goals  for  risk  analyses  and  the  need 
to  explicitly  address  uncertainty  in  a  decision-making  context,  developing  methods  to 
effectively  document,  calculate,  and  communicate  uncertainty  in  risk  analyses  is  an  area 
of  major  research  for  the  future.  With  the  current  emphasis  on  documenting  uncertainty, 
we  hope  that  the  use  and  consideration  of  uncertainty  in  risk-based  decisions  will  con¬ 
tinue  to  evolve  and  improve.  In  this  paper,  we  address  some  of  the  issues  in  risk  uncer¬ 
tainty  analysis  and  attempt  to  cover  a  broad  spectrum  of  issues  and  approaches 
appropriate  for  addressing  uncertainty  in  risk  methods.  The  issues  discussed  in  this  pa¬ 
per  apply  to  many  areas  of  risk  analysis,  including  risk  methods  used  to  assess  contami¬ 
nants  associated  with  sediments. 

13.2  The  problem  of  uncertainfy 

Because  risk  assessment  is  a  decision-oriented  process,  the  USEPA  and  other  organiza¬ 
tions  have  promoted  uncertainty  analysis  as  a  tool  for  addressing  the  expected  variation 
in  environmental  response  to  risk-based  decisions.  As  an  example,  USEPA  endorses  un¬ 
certainty  analysis  in  the  Framework  For  Ecological  Risk  Assessment  (USEPA  1992).  Un¬ 
certainty  analyses  will  become  ingrained  within  the  scientific  community  when 
decision-makers  become  aware  of  the  importance  of  explicitly  addressing  measures  of 
accuracy,  precision,  and  reliability  of  scientific  information  available  for  risk-based  deci¬ 
sions  (Chapter  17).  The  information  can  take  the  form  of  either  qualitative  or  quantita¬ 
tive  measurements  of  chemical,  biological,  or  toxicological  data,  including  information 
on  the  accuracy  and  precision  of  risk  estimates. 

Uncertainty  analyses  can  be  straightforward  or  complicated;  they  can  be  easily  imple¬ 
mented  or  oriented  toward  solving  complex  equations  requiring  significant  computing 
power.  There  are  no  typical  or  accepted  methods  for  conducting  an  uncertainty  analysis, 
and  proponents  exist  on  both  sides  of  the  argument  for  uncertainty  analysis  standardiza- 

199 


SET  AC  Press 


200 


ECOLOGICAL  RISK  ASSESSMENT  OF  CONTAMINATED  SEDIMENTS 

tion.  Such  confusion  can  frustrate  the  investigator,  possibly  resulting  in  badly  conducted, 
or  simply  ignored,  uncertainty  analyses.  These  problems  affect  both  the  regulatory  and 
regulated  communities.  For  example,  in  the  United  States,  the  need  for  and  interpreta¬ 
tion  of  uncertainty  analyses  have  been  the  basis  of  legal  actions  {e.g.,  uncertainty  in  tox¬ 
icity  testing  endpoints  used  in  NPDES  permits;  Warren-Hicks  and  Parkhurst  1992). 

The  goal  of  an  uncertainty  analysis  is  to  provide  the  risk  manager  with  the  most  complete 
information  available  on  the  expected  outcomes  of  management  options.  Thus,  the 
analysis  provides  information  on  not  just  the  most  likely  outcome,  but  the  whole  range 
of  potential  outcomes  resulting  from  a  management  decision.  Uncertainty  in  risk  meth¬ 
ods,  procedures,  and  data  should  not  negate  their  use;  rather,  assessing  the  degree  of 
uncertainty  should  provide  information  and  insight  useful  for  selecting  alternative  man¬ 
agement  actions  using  these  tools.  A  correct  treatment  of  uncertainty  provides  risk  man¬ 
agers  with  a  stronger  scientific  basis  for  making  risk-based  decisions.  Risk  analysis  is 
concerned  with  evaluating  current  and  future  risk  caused  by  environmental  perturba¬ 
tions,  and  the  response  of  the  environment  to  candidate  remediation  options.  Within  the 
risk  characterization  process  (USEPA 1992),  an  uncertainty  analysis  may  be  used  to  iden¬ 
tify  particular  components  and  sources  of  error  for  the  purpose  of  risk  reduction,  ulti¬ 
mately  leading  to  the  selection  of  a  remediation  alternative  in  the  risk  management  phase 
of  the  risk  assessment. 

Analysis  of  uncertainty  provides  valuable  information  that  can  aid  risk  abatement  deci¬ 
sions,  for  example,  estimating  the  probability  of  success  (or  risk  of  failure)  of  a  remedia¬ 
tion  strategy.  The  amount  of  uncertainty  in  the  expected  outcome  can  be  balanced 
against  the  need  to  ensure  environmental  protection  with  a  relatively  high  degree  of  con¬ 
fidence.  For  example,  in  some  cases  the  uncertainty  in  a  biological  impact  may  be  too 
large  to  implement  management  options.  As  a  consequence,  the  manager  may  defer  ac¬ 
tion  and  gather  additional  information  rather  than  initiate  costly  pollution  controls  that 
may  or  may  not  achieve  the  desired  level  of  environmental  protection.  If  both  the  uncer¬ 
tainty  and  costs  of  controls  are  high,  adopting  relatively  low-cost  remediation  strategies 
for  an  interim  period  may  be  wise  while  additional  monitoring  occurs.  On  the  other 
hand,  delaying  implementation  of  the  remediation  may  indeed  result  in  increased  envi¬ 
ronmental  damage,  and  possible  public  outcry.  Large  uncertainty  in  the  available  data 
and  information  does  not  mean  environmental  damage  is  not  occurring.  Damage  may  be 
occurring,  but  we  cannot  isolate  the  degree  of  damage  in  the  available  information. 
Choosing  the  level  uncertainty  that  is  acceptable  for  the  decision  at  hand  is  a  task  for  the 
risk  manager. 

In  any  case,  overlooking  uncertainty  in  risk-based  decisions  can  lead  to  the  illusion  that 
the  scientific  information  upon  which  the  decisions  are  based  is  adequate,  when  in  fact 
the  data  exhibit  larger  than  acceptable  variability,  or  can  result  in  wrongly  delaying  reme¬ 
diation  action  when  in  fact  the  variability  in  the  risk  estimates  are  unexpectedly  small.  In 
either  case,  risk  managers  may  exhibit  surprise,  or  disappointment,  when  the  environ¬ 
mental  outcomes  are  substantially  different  from  expectations  (Reckhow  and  Chapra 
1983). 
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The  amount  of  uncertainty  in  risk  estimates  that  is  acceptable  within  a  risk  analysis 
framework  can  reflect  a  policy  decision  for  a  regulator  or  specific  industry  {e.g.,  the  data 
quality  objective  process;  USEPA 1986)  or  can  reflect  an  individual  s  level  of  comfort.  For 
example,  a  risk  manager  (or  organization)  may  be  a  risk  taker  (i.e.,  willing  to  accept  a 
high  level  of  uncertainty  in  outcome  in  the  selection  of  an  optimal  management  decision) 
or  may  be  risk  averse  requiring  a  significant  margin  of  safety,  so  that  the  uncertainty 
in  achieving  the  desired  outcome  is  small,  despite  potentially  higher  costs).  This  concep¬ 
tual  development  of  risk  as  a  decision  science  is  extensively  applied  in  business  and  en¬ 
gineering  departments  in  the  United  States.  With  the  emphasis  on  environmental 
decision-making  in  the  ecological  risk  framework,  the  concept  of  risk  aversion/risk  accep¬ 
tance  (see  Winkler  1972)  is  growing  in  popularity  (Morgan  and  Henrion  1990). 

In  risk  analysis,  scientific  uncertainty  derives  from  many  sources,  including  inadequate 
scientific  knowledge,  natural  variability,  measurement  error,  sampling  error,  and  incor¬ 
rect  assumptions.  Uncertainty  also  arises  from  model  mis-specification,  including  errors 
in  statistics,  parameters,  initial  conditions,  and  failure  to  appropriately  capture  expert 
judgment.  The  outcome  from  an  analysis  of  uncertainty  can  be  a  probability  distribution 
on  the  quantity  of  interest  or  a  ranking  scheme  based  on  expert  judgment.  In  some  cases, 
simple  estimates  of  uncertainty,  such  as  the  standard  error  of  the  mean,  may  be  relevant; 
more  complex  measures,  such  as  prediction  intervals,  odds  ratios,  marginal  density  func¬ 
tions,  Bayes  factors  (see  Berger  1985),  and  robust  estimates  of  centrality  and  variance 
(Gilbert  1987)  may  also  be  appropriate.  Unfortunately,  this  large  array  of  possible  ap¬ 
proaches  and  methods  to  assessing  uncertainty  can  inhibit  the  understanding  and  inter¬ 
pretation  of  the  analysis  results,  thereby  effectively  reducing  the  utility  of  the  uncertainty 
analysis  in  risk-based  decision-making. 

In  the  final  analysis,  we  must  overcome  the  dilemma  of  uncertainty  if  the  risk  process  is 
to  succeed  and  become  an  integral  part  of  our  scientific  assessments  of  the  environment. 
As  always,  we  can  look  toward  better  education  and  increased  scientific  awareness  of  the 
importance  of  uncertainty  in  risk-based  decision-making.  But,  uncertainty  analysis  will 
become  ingrained  in  our  risk  paradigms  only  when  decision-makers  find  value  in  the 
process  and  realize  the  usefulness  of  the  information  gained  in  the  analysis. 

13.2.1  Some  viewpoints  on  uncertainty 

Given  the  variety  of  statistical  techniques,  models,  and  approaches  available  for  uncer¬ 
tainty  analysis,  it  is  easy  to  understand  why  measures  of  uncertainty  are  frequently  over¬ 
looked  or  minimized  in  practice.  In  fact,  some  assessment  of  uncertainty  enters  any 
decision  process,  and  it  may  be  treated  either  explicitly  or  implicitly.  In  some  cases,  a  risk 
manager  may  be  presented  with  a  qualitative  analysis  of  uncertainty  which  enables  an 
informed  evaluation  of  candidate  management  options.  In  other  cases,  a  risk  manager 
presented  with  a  point  estimate  of  risk  might  decide  that  the  risk  analysis  has  low  (or 
high)  reliability  and  might  assign  it  little  (or  great)  weight  in  the  formation  of  a  manage¬ 
ment  decision  (implicitly  assigning  an  evaluation  of  uncertainty  associated  with  the  point 
estimate). 
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An  assumption  frequently  underlying  statistical  measures  of  uncertainty  {e.g.,  mean 
square  error)  is  that  the  lack  of  precision  about  the  measurement  endpoint  or  model 
predictions  can  be  reduced  with  more  data.  This  derives  from  classical  statistical  theory 
where  the  variance  of  the  sample  mean  is  a  function  of  the  number  of  replicate  samples. 
With  large  sample  sizes  (greater  than  20  to  30),  the  expected  reduction  in  uncertainty 
about  the  mean  with  larger  sample  sizes  is  usually  found  in  practice.  However,  with  small 
or  moderate  sample  sizes,  additional  data  can  easily  increase  the  realized  sample  variance 
in  the  mean.  In  small  samples,  each  individual  value  is  relatively  influential,  therefore  an 
extreme  data  point  (a  sample  far  from  the  calculated  mean)  can  easily  result  in  a  large 
calculated  variance  and  a  comparably  large  shift  in  the  mean  value.  This  problem  empha¬ 
sizes  the  need  for  calculating  and  interpreting  standardized  measures  of  variance  and 
centrality  (like  the  mean  and  variance  of  a  normal  distribution).  In  response  to  such  is¬ 
sues,  investigators  should  realize  that  a  variety  of  statistical  measures  exists,  each  with 
their  own  advantages,  for  describing  the  characteristics  of  data  {e.g.,  robust  measures  of 
centrality  such  as  the  trimmed  mean  [Gilbert  1987],  shrinkage  estimators  of  variance 
[Berger  1985],  and  Bayesian  methods  for  combining  data  [Warren-Hicks  and  Wolpert 
1994]). 

In  general,  uncertainty  is  the  absence  of  information  that  may  or  may  not  be  obtainable. 
In  this  regard,  the  concept  of  uncertainty  is  categorized  into  those  components  that  are 
reducible  and  those  components  that  are  not  reducible.  The  reducible  component  of 
uncertainty  is  associated  with  the  ability  to  predict  or  measure  a  single  value,  say  benthic 
species  richness  at  a  single  site  (here,  true  replicate  measurements  would  provide  a  mea¬ 
sure  of  sampling  error).  This  uncertainty  can  be  reduced  with  a  better  model,  improved 
method,  or  more  information.  In  practice,  we  never  reduce  all  sources  of  uncertainty  in 
an  assessment,  and  some  sources  of  variation  may  not  be  reducible.  For  example,  when 
testing  and  validating  a  selected  exposure  model,  natural  variability  in  flow  and  other 
hydrological  components  will  result  in  the  presence  of  model  prediction  error,  regardless 
of  the  quality  of  the  model  or  site-specific  measurements  used  in  model  testing.  The  in¬ 
vestigator  must  establish  the  degree  of  model  prediction  error  that  is  appropriate  for  the 
decisions  to  be  made  with  the  model  predictions.  This  viewpoint  of  uncertainty  is  consis¬ 
tent  with  USEPA’s  data  quality  objectives  process  (USEPA 1986;  Warren-Hicks  1989). 

13.2.2  Is  uncertainty  analysis  always  needed? 

Situations  exist  where  a  formal  uncertainty  analysis  may  not  be  useful  prior  to  obtaining 
additional  data.  In  general,  if  little  knowledge  exists  about  the  environmental  process 
under  investigation,  or  if  little  empirical  data  are  available,  then  the  information  gener¬ 
ated  from  a  prospective  uncertainty  analysis  is  limited  in  scope  and  interpretation.  A 
good  strategy  is  to  increase  the  amount  of  available  information  {e.g.,  gather  data  or  con¬ 
sult  experts)  prior  to  conducting  a  formal  analysis.  In  the  extreme,  if  you  are  completely 
uncertain,  it  may  not  be  possible  to  quantify  your  lack  of  understanding.  Of  course,  the 
extreme  case  where  no  information  of  any  kind  is  available  is  not  likely.  As  part  of  a  com¬ 
prehensive  strategy,  however,  a  risk  analyst  should  judge  whether  enough  information  is 
available  to  warrant  a  formal  uncertainty  analysis.  For  example,  if  a  mechanistic  com- 
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puter  model  is  built  completely  from  a  conceptual  understanding  of  an  environmental 
process  {e.g.,  uptake  of  PCBs  by  flounder  in  southern  estuaries)  for  which  no  data  exist, 
employing  Monte  Carlo  analysis  to  estimate  the  model  prediction  uncertainty  is  not  in¬ 
formative  as  to  the  variance  of  actual  environmental  concentrations.  The  model  cannot 
be  used  to  estimate  actual  environmental  concentrations  without  calibration  to  on-site 
conditions,  which  require  actual  measurements.  If  remediation  decisions  require  esti¬ 
mates  of  the  mean  and  range  of  PCB  concentrations  in  the  water,  then  the  uncertainty 
analysis  will  provide  only  misleading  information.  When  little  empirical  data  are  avail¬ 
able,  qualitative  or  expert  judgment  assessment  of  uncertainty  can  focus  the  data  collec¬ 
tion  effort.  The  risk  assessor  could  be  better  served  spending  resources  to  gather 
empirical  data  that  can  be  used  to  calibrate  and  validate  the  model.  After  model  calibra¬ 
tion  and  validation  with  empirical  evidence,  estimating  the  relative  ability  of  the  model 
to  accurately  predict  the  risk  to  the  environment  is  appropriate. 

13.3  Methods 

Approaches  to  uncertainty  are  briefly  discussed  below,  classified  into  three  areas: 

•  Margin  of  safety  (safety  factors) 

•  Statistical  paradigms 

•  Qualitative  approaches  to  uncertainty 

This  discussion  attempts  to  provide  insight  into  the  various  approaches  to  dealing  with 
uncertainty  that  are  generally  used  in  risk  assessments. 

13.3.1  The  margin  of  safely  (or  safely  factor)  approach 

The  margin  of  safety  approach  in  environmental  assessment  is  frequently  used  in  regu¬ 
latory  settings  (MacDonald  et  al  1995).  The  terminology  results  from  analogy  to  engi¬ 
neering  design  practice,  where,  for  instance,  a  bridge  is  designed  to  accommodate 
significantly  more  than  the  greatest  expected  load,  thus  incorporating  a  margin  of  safety 
into  the  specification.  For  example,  USEPA  uses  safety  factors  when  establishing  water 
quality  criteria  for  the  protection  of  aquatic  life  (Stephan  and  Erickson  1988);  the  com¬ 
puted  concentration  of  a  toxic  chemical  estimated  to  protect  95%  of  the  aquatic  commu¬ 
nity  is  divided  in  half  to  adjust  for  the  use  of  LC50  data  in  the  analysis.  The  CWA  requires 
a  margin  of  safety  in  the  estimation  of  total  maximum  daily  load  (TMDL)  for  the  attain¬ 
ment  of  designated  uses,  which  is  “normally  incorporated  into  the  conservative  assump¬ 
tions  within  the  calculations  or  models”  (USEPA  1991). 

In  practice,  the  margin  of  safety  approach  is  generally  environmentally  protective  and  is 
useful  when  little  or  no  data  are  available  for  more  quantitative  estimates  of  variance,  but 
provides  insufficient  basis  for  assessing  the  tradeoffs  between  costs  and  risk.  In  addition, 
when  there  are  many  sources  of  uncertainty  (e.g.,  sources  of  uncertainty  in  toxicity  test¬ 
ing  or  exposure  concentrations),  safety  factors  provide  a  relatively  easy  way  to  implicitly 
aggregate  the  multiple  sources  of  uncertainty.  In  this  case,  a  complicated  statistical  vari¬ 
ance  components  analysis  may  be  extremely  costly  and  impractical.  In  a  first-tier  risk 
analysis  attempting  to  screen  toxic  chemicals,  safety  factors  may  be  appropriate  under 
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the  assumption  that  the  screening  criteria  should  be  conservative  (Parkhurst  etal  1995). 
The  bi^est  drawback  of  blindly  applying  a  margin  of  safety  approach  is  that  it  may  result 
in  significant  over-design  and  waste  of  resources. 

If  the  goal  of  the  environmental  assessment  is  to  accurately  reflect  the  probability  of  en¬ 
vironmental  impact,  then  safety  factors  are  usually  inappropriate.  Here,  the  investigator 
is  focused  on  accuracy  and  precision,  and  not  on  conservative  estimates  of  risk.  In  a  risk 
assessment,  we  are  generally  interested  in  describing  the  potential  impact  with.as  much 
accuracy  and  precision  as  possible  (expect  possibly  during  a  screening-level  assessment 
in  which  conservative  assumptions  are  frequently  employed).  Therefore,  the  data  require¬ 
ments  are  higher,  and  the  requirements  for  high  prediction  accuracy  are  greater.  Conser¬ 
vative  safety  factors  should  not  be  used  with  these  goals  in  mind.  However,  conservative 
safety  factors  are  generally  appropriate  for  assessments  attempting  to  screen  nontoxic 
chemicals  from  further  analysis.  The  drawbacks  to  using  safety  factors  are  the  loss  of  in¬ 
formation  and  a  tendency  to  reduce  the  amount  of  effort  in  obtaining  additional  data. 
Incumbent  upon  the  risk  assessor  is  to  identify  the  goals  of  the  assessment  and  the 
amount  of  uncertainty  he/ she  is  willing  to  accept  before  using  the  safety  factor  approach 
to  uncertainty. 

13.3.2  Statistical  paradigms 

Below,  we  discuss  two  statistical  paradigms  for  uncertainty  analysis.  The  classical  statis¬ 
tical  approach,  or  frequentist  approach,  is  most  familiar  to  environmental  scientists.  The 
second  paradigm  is  based  on  Bayes  theory  and  has  a  long  history  in  engineering  and 
business  applications.  Bayesian  methods  are  associated  with  decision  theory  and  provide 
a  rigorous  fi'amework  for  combining  multiple  types  of  data  in  a  risk  analysis  {e.g.,  labora¬ 
tory  toxicity  data,  benthos  data,  and  measures  of  value  such  as  cost). 

73.3.2.7  Frequentist  paradigm 

Most  environmental  scientists  are  familiar  with  the  frequentist  approach  to  uncertainty 
estimation.  Frequentist  theory  is  based  on  the  long-run  expectations  of  statistical  estima¬ 
tors.  For  example,  the  standard  estimator  for  the  mean  of  a  distribution 

N 

1=1 

N 

can  be  shown  with  asymptotic  theory  to  converge  to  the  population  mean  (which  is  as¬ 
sumed  to  be  fixed,  but  in  reality  is  unknown).  This  theory  is  used  to  assign  the  above  es¬ 
timator  with  the  statistical  properties  of  being  both  unbiased  (convergence  in 
expectation  to  the  population  mean)  and  efficient  (smallest  variance).  These  desirable 
properties  apply  only  to  the  statistical  estimators,  not  to  the  actual  data  that  is  collected. 
Unfortunately,  any  single  biological  measurement  or  sample  is  neither  unbiased  nor  ef¬ 
ficient.  In  fact,  the  mean  from  a  small  random  sample  could  be  very  different  from  the 
population  mean.  Standard  measures  of  uncertainty  for  the  frequentist  are  usually  asso- 
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dated  with  the  normal  distribution,  for  example,  the  normal  variance,  normal  standard 
deviation,  and  standard  error  of  the  normal  mean.  Popular  methods  for  estimating  vari¬ 
ability  include  variance  estimates  based  on  the  normal  distribution,  Monte  Carlo  tech¬ 
niques,  and  first-order  error  analysis. 

Most  statistical  computer  software  includes  frequentist  estimates  of  variance.  Unfortu¬ 
nately,  the  proliferation  of  statistical  software  serves  to  promote  the  use  of  statistics  by 
untrained  individuals  not  familiar  with  the  sometimes  subtle,  but  important,  statistical 
theories  upon  which  the  results  depend.  For  example,  Monte  Carlo  methods  are  fre¬ 
quently  misused  and  their  underlying  assumptions  violated:  Figure  13-1  summarizes  two 
Monte  Carlo  analyses  of  the  same  risk  assessment  endpoint  and  shows  that  a  change  in 
the  specification  of  distributions  can  result  in  a  large  difference  in  the  estimation  of  risk. 
Hypothesis  testing  procedures  also  are  a  popular  frequentist  method  for  determining 
differences  between  population  means.  Again,  opinions  concerning  the  appropriate  use 
of  hypothesis  testing  methods  vary  widely  (Casella  and  Berger  1987;  Suter  1996). 

Uncertainty  in  model  predictions  can  also  be  directly  evaluated  based  on  uncertainty  or 
variability  in  model  inputs.  A  popular,  and  relatively  user-ffiendly,  method  for  propagat¬ 
ing  uncertainty  through  equations  or  models  is  first-order  error  analysis. 

First-order  analysis  can  be  written  as 


Sxi  Sxj  pXLXj 


where 


Xj  = 


Pxi.Xj 

Sv  S^.  = 

Xi,  Xj 

/  = 


a  model  parameter  estimated  from  the  data  (e.g.,  a  regression  coeffi¬ 
cient), 

the  correlation  between  the  /th  and  jth.  model  parameter, 
the  standard  deviation  of  the  individual  model  parameters,  and 
a  function  (e.g.,  the  regression  model,  or  mechanistic  model  used  for 
prediction). 


First-order  analysis  (like  Monte  Carlo  analysis)  requires  an  explicit  estimation  of  the  cor¬ 
relation  between  the  uncertain  parameters.  In  most  cases,  first-order  analysis  can  be 
implemented  with  a  calculator  and  provides  a  relatively  straightforward  means  of  propa¬ 
gating  error  from  multiple  sources  (e.g.,  parameters  in  a  multivariate  regression  model) 
into  the  error  in  model  prediction.  Applications  of  first-order  analysis  can  be  found  in 
Reckhow  and  Chapra  (1983). 

Although  there  are  many  arguments  for  and  against  the  classical  frequentist  approach  to 
uncertainty  estimation,  the  most  compelling  reason  for  using  the  frequentist  measures  is 
that  most  of  the  scientific  community  is  familiar  with  the  methods.  The  methods  have 
been  adopted  in  most  environmental  programs,  are  generally  understood,  and  usually 
are  accepted  as  an  appropriate  means  of  data  analysis. 
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Figure  13-1  Use  and  abuse  of  Monte  Carlo  analysis 
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73.3.2.2  Bayesian  paradigm 

Another  approach  to  the  evaluation  of  uncertainty  is  the  Bayesian  approach  (Press  1989). 
Bayesian  methods  are  based  on  Bayes’  theorem,  which  can  be  written  as  follows: 


P(0 1  y)  -  the  distribution  of  the  parameters  conditional  on  the  observed  data, 
referred  to  as  the  posterior  distribution,  which  contains  all  current 
information; 

L(y  1 0)  =  the  likelihood  function  based  on  the  sample  data,  containing  all 
information  available  in  the  current  sample;  and 
7t(0)  =  the  prior  distribution  of  the  parameters  before  observing  the  data;  the 
distribution  can  be  based  on  subjective  judgment  or  actual  data. 

Here,  parameters  {e.g.,  measurement  endpoints,  or  parameters  in  a  predictive  model)  are 
treated  as  random  variables  (unlike  classical  statistics  in  which  the  population  mean  is 
fixed  but  unknown).  In  the  Bayesian  concept,  the  posterior  probability  distribution  ex¬ 
presses  a  “degree  of  belief”  about  the  variable  of  interest,  based  on  prior  information 
obtained  before  data  collection  (the  prior),  and  the  sample  data  (the  likelihood)  collected 
during  the  experiment  or  monitoring  study.  The  prior  and  likelihood  function  are  com- 
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bined  using  Bayes  theorem  to  form  a  revised  estimate  of  the  parameters  (the  posterior 
distribution),  which  contains  all  of  the  available  information  about  the  parameters  (un¬ 
like  classical  statistics,  which  infers  information  about  the  characteristics  of  the  param¬ 
eter  based  on  the  expected  value  of  future  data,  which  is  the  long-run  frequency  concept). 
Bayesian  methods  can  be  used  to  incorporate  expert  judgment  and  information  from 
historical  sources  into  the  estimation  process  via  the  prior  distribution.  Bayesian  meth¬ 
ods  form  the  basis  of  the  decision  sciences,  which  have  historically  been  used  and  devel¬ 
oped  in  engineering  and  business  applications.  In  addition,  Bayesian  methods  can  be 
used  to  assess  the  risk  associated  with  alternative  actions  such  as  site  remediation  strat¬ 
egies  (Reckhow  1994),  to  explicitly  incorporate  costs  of  various  alternative  strategies  into 
the  decision  process,  and  to  evaluate  the  value  of  collecting  further  information  (Dakins 
e/ a:/.  1994). 

Figure  13-2  presents  a  conceptual  output  from  a  Bayesian  decision  approach.  While  this 
figure  may  be  difficult  to  develop  in  practice  (mostly  because  of  the  lack  of  appropriate 
information),  the  information  depicted  on  the  graph  is  implicitly  incorporated  into  most 
risk  management  decisions.  Decision  analysis  combines  the  probability  distribution  of 
the  risk  endpoint  of  interest  with  a  function  representing  the  net  benefits  (or  net  loss,  or 
utility)  that  occur  across  the  range  of  the  risk  endpoint.  The  benefits  are  evaluated  for 
several  management  options.  In  our  example,  decision  analysis  seeks  to  maximize  the  net 
benefits  of  management  options,  given  the  uncertainty  in  the  risk  endpoint.  The  prob¬ 
ability  distribution  needed  from  the  analysis  is  effectively  the  denominator  of  Bayes  theo¬ 
rem  (called  the  predictive  distribution).  The  benefits  function  need  for  the  analysis  is 
typically  difficult  to  define.  For  example,  valuing  the  benefit  of  “no  species  loss”  can  be 
difficult.  In  fact,  the  benefit  for  a  fisherman  is  probably  different  than  that  for  an  indus¬ 
try  using  the  water  for  cooling.  Generally,  risk  managers  do  not  explicitly  generate  ben¬ 
efit  functions  when  making  decisions.  However,  implicitly,  managers  weigh  the  amount 
of  information,  the  degree  of  uncertainty,  and  the  relative  benefits  of  management  op¬ 
tions  when  making  decisions.  Decision  theory  provides  a  formalized  way  to  incorporate 
all  of  this  information  in  a  rigorous  fashion. 

Given  the  data,  decision  analysis  provides  a  tool  for  rigorously  combining  information 
into  a  probabilistic  assessment  that  incorporates  uncertainty  and  is  useful  for  aiding 
management  decisions.  Figure  13-2  shows  the  probability  of  risk  (X-axis;  e.g.,  the  prob¬ 
ability  of  species  loss  in  an  area  of  contaminated  sediment)  and  a  measure  of  net  benefits 
or  value  (Y-axis;  the  sum  of  positive  benefits,  e.g.,  revenue  due  to  recreational  use  includ¬ 
ing  good  public  relations,  and  negative  benefits,  such  as  costs  of  treatment  or  fines  im¬ 
posed  by  regulatory  agencies).  In  addition,  from  the  figure,  a  risk  manager  can  find  the 
remediation  strategy  that  minimizes  the  expected  loss,  given  the  uncertainty  in  the  risk 
estimate.  Note  that  the  uncertainty  in  the  site-specific  species  loss  leads  to  several  treat¬ 
ment  strategies  depending  upon  the  risk  the  manager  is  willing  to  take.  In  the  figure,  no 
treatment  appears  to  be  optimal  at  expected  species  loss;  however,  for  a  risk-averse  deci¬ 
sion  (right-hand  tail  of  the  distribution  of  species  loss),  the  treatment  option  gives  greater 
net  benefits.  Practical  applications  of  this  approach  are  available  in  Reckhow  (1985). 
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Figure  13-2  Expected  net  benefit  from  management  actions 


13.3.3  Qualitative  estimates  of  uncertainty 

An  important  source  of  information  in  most  risk  assessments  is  expert  judgment.  In 
those  cases  where  little  monitoring  data  are  available,  expert  judgment  plays  an  ever- 
increasing  role  in  uncertainty  estimation.  Expert  judgment  can  be  incorporated  into  the 
risk  process  in  many  places  (Meyer  and  Booker  1991),  including  exposure  assessment, 
effects  assessment,  and  risk  characterization.  We  encourage  risk  assessors  to  incorporate 
expert  judgment  in  a  rigorous  fashion,  either  through  Bayes  theory  (Winkler  1967, 1977, 
1978, 1980;  Kadane  et  al  1980)  or  through  rigorously  designed  experiments.  While  ex¬ 
pert  judgment  is  implicitly  a  part  of  all  risk-based  decisions,  a  formal  approach  provides 
a  level  of  credibility  that  is  frequently  lacking  in  many  applied  applications  of  risk  assess¬ 
ment.  For  example,  Winkler  (1977, 1978, 1980)  developed  interview  methods  for  elicit¬ 
ing  expert  opinion.  Because  of  the  form  of  the  questions,  he  is  able  to  develop 
quantitative  estimates  of  variance  surrounding  the  endpoint  of  interest. 
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13.4  Conclusions 

In  most  ecological  applications,  uncertainty  analysis  can  provide  useful  information  to 
risk  decision  managers  with  respect  to  the  quality  of  information  available  for  making 
decisions.  Many  of  the  issues  in  uncertainty  are  briefly  discussed,  or  referenced,  above. 
We  hope  that  uncertainty  analysis  becomes  an  integral  part  of  the  risk  framework  and 
that  the  concept  of  uncertainty  and  variability  become  institutionalized  in  the  risk  assess¬ 
ment  process  as  practiced  by  both  industry  and  the  regulated  community. 
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14.1  introduction 

Assessing  biological  exposures  to  contaminants  is  an  essential  step  in  understanding  the 
toxicity  risks  from  contaminated  sediments.  Total  contaminant  concentrations  in  sedi¬ 
ments  can  be  directly  determined  and  are  the  simplest  measure  of  exposure.  However, 
the  bioavailability  of  the  contaminant  controls  the  dose  that  organisms  actually  receive. 
Predictions  of  risk  from  contaminated  sediments  will  be  most  relevant  when  exposure 
and  dose  are  both  understood.  This  requires  accurate  methods  for  predicting  bioavail¬ 
ability  in  a  site-specific  manner  and  for  linking  bioavailability  to  toxicity,  as  the  latter  is 
manifested  in  nature. 

In  this  chapter,  we  evaluate  uncertainties  in  the  methods  and  models  available  for  de¬ 
scribing  contaminant  bioavailability  and  their  applicability  to  site-specific  risk  assess¬ 
ment.  We  use  correspondence,  response,  and  mechanisms  in  the  evaluation:  Do 
bioavailability  predictions  generally  correspond  with  observations  in  nature  {i.e.,  in 
broad  datasets)?  Do  organisms  respond  in  specific  instances  as  predicted  {e.g.,  short-term 
responsiveness  to  bioavailability  changes  in  a  field  experiment  or  after  an  event)?  Can  the 
methods  and  models  be  justified  on  the  basis  of  mechanistic  understanding?  We  con¬ 
clude  that  geochemically  robust  methods  have  been  developed  to  date,  but  important 
uncertainties  in  the  models  and  methods  stem  from  incomplete  consideration  of  biologi¬ 
cal  processes.  Including  biological  processes  in  the  hydrophobicity  model  has  advanced 
risk  predictions  for  organic  chemicals.  For  metals,  kinetic  models  that  consider  both  ex¬ 
ternal  geochemical  controls  and  internal  biological  processes  will  be  necessary.  Better 
understanding  of  tissue-residue-based  toxicity  may  be  a  necessary  supplement  for  link¬ 
ing  bioavailability  (dose)  and  toxicity. 

14.2  Predicting  exposure  from  total  concentrations 

Concentration  provides  a  first  order  control  on  the  ecological  risk  from  contaminants  in 
sediments.  Total  concentration  is  recommended  in  some  methods  as  the  basis  for  esti¬ 
mating  exposure  (Long  et  al  1995),  but  mechanistic  and  correlative  studies  have  long 
shown  substantial  uncertainties  in  this  approach  (Neely  et  al  1974;  Veith  et  al  1979; 
Luoma  1983, 1989).  For  example,  no  relationship  is  found  between  sediment  chemical 
concentrations  and  bioavailability  or  biological  effects  when  experiments  are  conducted 
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with  sediments  that  differ  widely  in  critical  geochemical  characteristics  (Luoma  and 
Jenne  1977;  DiToro  etal  1990, 1991). 

Quantitative  uncertainties  in  the  relationship  between  concentration  and  bioavailability 
can  be  variable  in  nature.  Significant,  even  strong,  correlations  between  bioavailability 
and  total  concentration  are  found  within  simple  gradients  or  among  geochemically  simi¬ 
lar  environments  (Bryan  1985).  It  is  when  comparisons  are  made  across  environments 
(or  across  complex  geochemical  gradients)  that  less  correspondence  is  found.  The  large 
datasets  from  estuaries  of  the  United  Kingdom  assembled  by  Bryan  and  co-workers  illus¬ 
trate  the  effects  of  complex  geochemistry  (Luoma  and  Bryan  1981;  Bryan  1985;  Bryan  et 
al.  1985;  Bryan  and  Langston  1992).  For  these  studies,  concentrations  of  metals  were 
determined  in  fine-grained  surface  sediments  (judged  to  be  oxidized  by  appearance)  as 
well  as  in  bivalve  and  polychaete  tissues  (three  of  the  most  commonly  employed  species 
are  shown  in  Table  14-1).  The  estuaries  included  a  wide  range  of  physical,  biogeochemi¬ 
cal,  and  pollution  conditions.  Data  ranges  for  all  elements  were  several  orders  of  magni¬ 
tude,  and  confounding  cross-correlations  among  independent  variables  were  minimized 
(Luoma  and  Bryan  1981).  Obvious  sources  of  bias  such  as  particle  size,  large  reduction/ 
oxidation  (redox)  differences,  or  dilution  of  tissue  concentrations  by  reproductive  tissue 
were  carefully  controlled. 

Predictions  of  bioaccumulation  from  sediment  concentrations  in  the  English  estuaries 
varied  among  elements  and  among  species  (Table  14-1).  No  significant  correlation  was 
observed  between  tissue  residues  and  sediments  in  a  few  instances  {e.g.,  Cd  in  the  poly¬ 
chaete  Nereis  diversicolor;  Cu  in  the  bivalve  Scrobicularia  plana).  Bio  availability  in  these 
cases  was  completely  unpredictable  from  total  metal  concentrations  in  sediments.  In 
other  cases,  the  uncertainty  in  predicting  bioaccumulation  from  only  sediment  concen¬ 
trations  was  not  as  great  as  some  mechanistic  studies  would  suggest.  Silver  and  lead  con¬ 
centrations  in  sediments  explained  about  half  the  variance  in  bioaccumulation  in  all 
three  species.  Results  for  Cd  and  Cu  were  variable  among  species,  suggesting  a  significant 
biological  contribution  to  the  uncertainties.  Tissue  residues  commonly  differed  by  ten¬ 
fold  at  median  sediment  concentrations,  even  where  half  of  the  variance  was  explained  by 
the  correlation  (Bryan  1985;  Bryan  and  Langston  1992).  Thus,  for  all  metals,  predictions 
of  bioavailability  from  concentration  alone  were  usually  well  outside  the  two-fold  crite¬ 
ria  for  accuracy  suggested  by  Landrum  et  al  (1992). 

14.3  Incorporating  bioovailobility  into  exposure  assessment 

It  is  difficult  to  quantify  the  uncertainty  in  many  estimates  of  bioavailability  because  criti¬ 
cal  processes  «re  not  fully  understood.  Knowledge  is  growing  along  a  similar  path  for 
metals  and  organic  chemicals,  although  the  latter  is  more  fully  developed.  Below  we  de¬ 
scribe,  briefly,  the  state  of  the  existing  methods  applied  to  exposure  assessments.  Equilib¬ 
rium  partitioning  (Di  Toro  et  al.  1991)  is  the  present  method  of  choice.  We  will  consider 
some  of  the  most  important  areas  of  uncertainty  in  EqP  and  use  it  as  a  basis  for  compari¬ 
son  to  other  methods.  Organic  chemical  bioavailability  and  the  associated  models  have 
been  widely  reviewed,  most  recently  by  Landrum  etal  (1992),  Hamelink  et  al  (1994), 
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Table  14-1  Percent  variance  in  bioaccumulation  of  Ag,  Cd,  Cu,  and  Pb  explained  by  variation  in 
sedimentary  concentrations  of  these  metals  (IN  HCl  extractions) 


Percent  variance  in  bioaccumulation 

Organism 

Ag 

Cd 

Cu 

Pb 

Nereis  diversicolor  (polychaete) 

41-62 

0-16 

72 

54 

Scrobicularia  plana  (bivalve) 

49 

29 

0 

50 

Macoma  balthica  (bivalve) 

69 

21 

10 

47 

Data  from  a  broad  range  of  estuaries  in  the  United  Kingdom. 

Regressions  published  in  Luoma  and  Bryan  1981,  Bryan  1985,  Bryan  and  Langston  1992. 


and  Meador  et  al.  (1995a);  we  will  try  to  minimize  redundancies  with  those  insightful 
discussions. 

14.3.1  Organic  chemicals 

Mackay  (1982)  suggested  that,  although  biologically  based  kinetic  models  were  the  tra¬ 
ditional  approach  to  describing  organic  contaminant  bioconcentration,  an  alternative 
was  to  “view  the  biotic  phase  {e.g.,  fish)  as  an  inanimate  volume  of  material  that  is  ap¬ 
proaching  thermodynamic  equilibrium  with  its  medium  {e.g.,  water)  as  defined  by  the 
chemical  potential  or  fugacity  of  the  bioconceritrating-persistent  solute.”  This  remains 
the  central  principle  guiding  risk  assessments  for  organic  chemicals  in  aquatic  habitats. 
Experimentally  derived  bioconcentration  factors  (where  bioconcentration  is  defined  as 
uptake  from  dissolved  phase)  for  different  organic  chemicals  were  related  to  the  hydro- 
phobicity  of  the  chemical  (Mackay  1982).  Bioaccumulation  (uptake  from  all  sources) 
from  sediments  was  not  necessarily  predictable  from  chemical  hydrophobicity  alone 
(Pereira  et  al.  1988;  Swackhamer  and  Hites  1988),  but  predictions  were  improved  by 
normalization  to  the  lipid  content  of  the  organisms  and  the  TOC  content  of  the  sedi¬ 
ments  {e.g,  Lake  et  al.  1990),  consistent  with  the  fugacity  theory.  Equilibrium  partition¬ 
ing  predicted  porewater  concentrations,  based  upon  hydrophobicity  and  concentrations 
of  the  chemical  normalized  to  OC  in  sediment  (Di  Toro  et  al.  1991).  Toxicity  was  pre¬ 
dicted  by  comparison  of  porewater  concentrations  with  dissolved  concentrations  that 
cause  toxicity  in  bioassays. 

Bioassays  that  manipulated  OC  in  sediments  provided  the  mechanistic  basis  for  predic¬ 
tions  of  bioavailability  to  benthos  by  EqP.  For  example,  Swartz  et  al.  (1990)  found  that 
EqP  predictions  explained  porewater  concentrations  in  three  sandy  marine  sediments 
spiked  with  fluoranthene.  Toxicity  to  two  amphipods  correlated  with  porewater  concen¬ 
trations,  although  the  species  differed  in  sensitivity.  Swartz  etal.  (1990)  cited  the  range  of 
toxicities  of  fluoranthene  to  be  8  to  62  (Jg/l;  comparable  to  12  to  41  pg/1  observed  in  their 
study  —  a  maximum  uncertainty  of  3X  to  4X. 
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Dissolved  organic  carbon  (DOC)  confounds  EqP  predictions  of  porewater  concentrations 
and  toxicity  of  organic  chemicals.  Equilibrium  partitioning  underestimated  porewater 
concentrations  of  fluoranthene  and  overestimated  toxicity  in  sediment  bioassays  where 
DOC  accumulated  in  pore  waters  (DeWitt  et  al  1992;  Suedel  et  al  1993).  The  uncertainty 
in  these  treatments  was  lOX  or  more.  Lower  porewater  concentrations  of  PAHs  than  pre¬ 
dicted  by  EqP  are  found  in  field  sediments  in  contrast  to  spiked  sediments;  the  source  of 
the  PAHs  may  be  one  cause  of  these  uncertainties  (Meador  et  al  1995b). 

Correlative  studies  of  organic  chemical  bioavailability  to  benthos  across  broad  geochemi¬ 
cal  gradients  have  received  little  attention.  In  one  of  the  few  field  studies  of  EqP  predic¬ 
tions,  Lake  et  al  (1990)  compared  PCB  concentrations  in  three  species  of  benthos  with 
EqP  predicted  accumulation  factors  (AF  was  lipid-normalized  tissue  PCB/TOC- 
normalized  sediment  PCB).  Animals  were  collected  from  New  Bedford  Harbor,  Long  Is¬ 
land  Sound,  and  Narragansett  Bay.  The  maximum  variability  in  the  aggregated  concen¬ 
tration  data  was  ~3X,  and  AFs  varied  by  3X  to  5X  between  the  molluscan  and  polychaete 
species,  apparently  due  to  species-specific  influences  on  bioaccumulation. 

14.3.2  Metals 

Some  consensus  exists  about  principles  that  guide  prediction  of  trace  element  bioavail¬ 
ability,  but  knowledge  uncertainties  are  probably  greater  than  for  organic  chemicals.  It 
is  clear  that  element  speciation  or  form,  not  total  metal  concentrations,  influence  bio¬ 
availability  of  both  dissolved  and  sediment-bound  trace  elements.  However, 
geochemical  controls  are  more  diverse  than  with  organic  contaminants.  The  primary 
influence  on  bioconcentration  (from  solution)  appears  to  be  the  activity  of  the  free  metal 
ion,  for  metals  that  behave  as  cations  (Cd,  Cu,  Pb,  Zn)  (Sunda  and  Guillard  1976;  Ander¬ 
son  and  Morel  1978;  Sunda  et  al  1978;  CampbHl  and  Tessier  1989).  Ag  does  not  follow 
this  principle  in  marine  waters  because  its  chloro-complex  is  bioavailable  (Engel  and 
Fowler  1989).  Methylation  of  some  elements  {e.g.,  Hg  and  Sn)  greatly  changes  bioavail¬ 
ability.  Oxidation  state  controls  the  bioavailability  of  elements  that  behave  as  anions  (Se, 
As,  Cr,  V). 

Tissue  accumulation  does  not  appear  to  be  driven  by  any  single  principle  analogous  to 
hydrophobicity.  Chemical  potential  (fugacity)  in  food  and  within  biotic  tissues  is  con¬ 
trolled  by  diverse  biochemical  reactions  and  a  variety  of  metal  forms  (e.g,  complexes  like 
metallothioneins  occur  in  cell  solution;  membrane-bound  forms  occur  within  and  out¬ 
side  cells,  as  do  granules  and  other  insoluble  bodies).  Fisher  (1986)  showed  that  differ¬ 
ences  in  bioconcentration  by  phytoplankton  among  cationic-behaving  metals  was 
explained  by  differences  in  hydroxyl  ligand  association  constants.  This  is  not  a  good  pre¬ 
dictor  of  bioaccumulation  in  animals,  however,  probably  because  of  the  diversity  of  the 
above  biochemical  reactions  within  organisms  and  in  the  food  sources  from  which  they 
assimilate  metals. 

Geochemical  processes  governing  metal  bioavailability  from  sediments  are  more  com¬ 
plex  than  for  organic  chemicals;  redox  state  is  important,  as  are  the  diversity  of  surfaces 
that  can  sorb  metals  (Jenne  1977).  In  the  laboratory,  bioavailability  from  oxidized  sedi- 
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merits  changes  with  the  sediment  component  to  which  the  metal  is  bound  (iron  oxides, 
manganese  oxides,  organic  materials,  carbonates;  Luoma  and  Jenne  1977).  Oxidized  es¬ 
tuarine  sediments  are  sufficiently  variable  in  nature  that  such  differences  in  metal  parti¬ 
tioning  among  such  components  might  be  expected  (Luoma  and  Davis  1983).  Formation 
of  sulfides  also  affects  metal  bioavailability.  Bioavailability  is  reduced  when  moles  of  AVS 
exceed  moles  of  weak  acid-extractable  metals  in  the  sediments  to  which  an  organism  is 
exposed  (Di  Toro  et  al  1990). 

Normalizations  or  correction  factors  were  derived  with  metals,  as  with  organic  contami¬ 
nants,  to  account  for  effects  of  sediment  geochemistry  on  bioavailability.  Simple,  single¬ 
factor  normalizations  of  sedimentary  Pb  (by  extractable  Fe),  Hg  (by  TOC),  As  (by 
extractable  Fe),  and  Cu  (by  extractable  Fe)  allowed  surprisingly  accurate  predictions  in 
large  datasets  from  geochemically  diverse  field  conditions  (Luoma  and  Bryan  1978; 
Langston  1980, 1982;  Tessier  etal  1984).  In  many  circumstances,  however,  factors  affect¬ 
ing  bioaccumulation  were  more  complex  than  accounted  for  by  simple  normalizations 
ie.g.,  Luoma  and  Bryan  1982;  Amyot  etal  1995). 

Di  Toro  et  al  (1990, 1991)  suggested  that  normalization  by  AVS  might  explain  varying 
metal  bioavailability  from  sediments.  Short-term  sediment  bioassays  showed  that  the 
molar  ratio  of  AVS/SEM  provided  a  first-order  control  on  activities  of  at  least  some  met¬ 
als  (Cd,  Ni)  in  pore  water  {e.g,,  Di  Toro  etal  1990;  Ankley  etal  1991).  Activities  were  re¬ 
duced  to  very  low  levels  at  ratios  <1,  because  of  the  high  stability  of  the  metal  sulfide. 
Toxicity  and  bioaccumulation  were  correlated  with  porewater  metal  activities  in  the  bio¬ 
assay.  Thus,  very  different  doses  resulted  if  two  sediments  had  the  same  total  metal  con¬ 
centrations  but  different  AVS  concentrations;  the  differences  were  corrected  by  AVS 
normalization. 

A  recent  field  experiment  in  a  lake  in  Quebec  illustrated  that  porewater  concentrations  of 
Cd  were  responsive  to  AVS  changes  in  nature  (Hare  et  al  1994).  Pretreated  sediments 
were  transplanted  in  trays  to  the  lake,  and  bioaccumulation  was  studied  in  organisms 
that  colonized  the  trays  of  sediment.  Porewater  Cd  concentrations  increased  as  excess  Cd 
increased  relative  to  AVS,  as  would  be  predicted  from  equilibrium  partitioning.  The  effect 
on  Cd  bioavailability  was  different  for  species  with  different  life  cycles,  however.  Organ¬ 
isms  that  diurnally  migrated  from  sediments  to  the  water  column  of  lakes  were  the  least 
influenced  by  enriched  porewater  metal  concentrations;  chironomids  that  live  buried  in 
sediments  were  the  most  influenced. 

Uncertainties  in  the  use  of  AVS  as  the  universal  sediment  normalizer  are  difficult  to  quan¬ 
tify  at  the  present  state  of  knowledge.  Studies  to  date  have  not  defined  how  to  determine 
AVS  concentrations  that  are  biologically  relevant  in  the  field.  Nor  have  they  confronted 
the  challenge  of  ensuring  that  the  AVS  concentrations  used  in  experiments  are  represen¬ 
tative  of  the  system  (or  risk  assessment  site)  of  interest.  Reduction/oxidation  reactions, 
and  thus  sulfide  concentrations,  are  patchy  on  biologically  relevant  micro-scales  within 
reduced  sediments.  AVS  varies  widely  with  depth  in  a  different  manner  in  every  sediment 
and  with  time  in  the  same  sediment  (see  Luoma  and  Ho  1993  and  Luoma  1995  for  re- 
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views  of  the  geochemical  literature  relevant  to  bioassays).  Sediments  collected  for  bioas¬ 
says  are  typically  taken  from  a  constant  depth,  such  as  the  surface  3  to  6  cm,  and  natural 
redox  gradients  are  homogenized  to  reduce  experimental  variability.  Typically,  limited 
numbers  of  samples  are  collected,  sometimes  from  a  single  type  of  depositional  area  in 
otherwise  complex  systems  (e.g.,  Brumbaugh  etal  1994).  Accurate  representation  of 
nature  will  require  systematic  characterization  of  AVS  variability  in  a  variety  of  circum¬ 
stance  (Table  14-2).  Ultimately,  standardized  methods  should  include  sampling  from  a 
depth  defined  by  site-specific  redox  conditions  and  by  the  redox  conditions  favored  in  the 
sedimentary  microhabitat  of  the  species  of  interest. 

Mechanistic  knowledge  of  sediment  geochemistry  suggests  factors  in  addition  to  AVS 
should  influence  porewater  metal  concentrations  and  metal  bioavailability  from  sedi¬ 
ments.  Most  macrofauna  have  an  obligate  requirement  for  oxygen.  Macrofaunal  and 
meiofaunal  life  concentrates  in  the  oxidized  zones  of  sediments,  where  thermodynamics 
do  not  favor  occurrence  of  sulfides.  On  the  other  hand,  the  relatively  slow  oxidation  kinet¬ 
ics  of  sulfides  suggest  some  sulfides  could  be  present  at  times  in  oxidized  sediments. 
Multi-ligand  equilibrium  models  have  successfully  predicted  porewater  concentrations 
and  bioavailability  of  metals  in  Canadian  lakes  (Campbell  and  Tessier  1989;  Belzile  et  al 
1990;  Tessier  et  al.  1993;  Amyot  et  ah  1994).  These  models  incorporate  binding  constants 
for  iron  oxides  and  organic  materials  but  do  not  consider  AVS.  Tessier  et  al  (1993) 
showed  a  strong  correlation  between  Cd  activity  in  pore  waters  (as  predicted  from  the 
model)  and  Cd  concentrations  in  a  bivalve.  Amyot  et  al  (1994)  used  multivariate  analy¬ 
sis  to  predict  metal  bioavailability  to  an  amphipod  from  two  lakes  using  the  oxic  equilib¬ 
rium  model  to  define  Cd  activities  in  solution.  They  could  explain  61%  to  81%  of  the 
variability  in  concentrations  of  Cd,  Cu,  Pb,  Ni,  and  Zn  in  the  amphipods  from  Cd  activ¬ 
ity,  pH,  and  Ca^^.  The  maximum  differences  between  predicted  and  observed  concentra¬ 
tions  were  3X  to  4X.  Seasonal  variability  was  poorly  explained,  possibly  because  of  the 
influences  of  biotic  variables  and  changes  in  diet  (Amyot  et  al  1994).  It  seems  likely  that 
AVS  and  oxic  surfaces  should  both  be  included  if  equilibrium  bioavailability  models  are 
to  be  employed  across  a  range  of  conditions. 

14.4  Are  equilibrium  models  sufficient  to  predict 
bioavailability? 

Where  multiple  factors  affect  a  process  like  bioavailability  (Table  14-3),  models  are  an 
effective  explanatory  and  predictive  tool.  The  simplicity  of  EqP  models  make  them  an 
attractive  choice  for  risk  assessment.  However,  they  are  also  characterized  by  incomplete 
or  incorrect  descriptions  of  some  fundamental  processes  and  a  limited  capability  for 
dealing  with  important  environmental  complexities  (Landrum  et  al  1992;  lannuzzi 
1995).  These  mechanistic  limitations  suggest  that  uncertainties  in  the  use  of  EqP  in  risk 
assessments  could  be  substantially  greater  than  the  3X  to  lOX  differences  found  in  the 
limited  assessments  of  variability  conducted  to  date.  Farrington  (1989)  concluded  that 
“numerous  field  data  and  experimental  studies  demonstrate  the  need  to  extend  the  equi¬ 
librium  theory  model....”  Landrum  et  al  (1992)  state  “...kinetic  models  are  needed  to 
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Table  14-2  Some  uncertainties  and  research  needs  in  defining  the  usefulness  of  AVS  normalization  for 
estimating  metal  bioavailability  in  sediments 


Uncertainty 

Problem 

Research  question 

Biological  relevance 
ofAVS 

Behavior/ecology  influences 
exposure 

Does  AVS  effect  vary  among  species  in 
equilibrated  sediment  systems? 

Does  AVS  vary  among  micro-habitats  in  a 
sediment? 

Spatially  representative 
sampling 

Redox  and  sulfide  concentrations 
are  patchy: 

•  Sediment  characteristics 

•  Depth  in  sediment 

•  Season 

•  Among  depositional  zones 
within  a  system 

Are  there  inaccuracies  due  to  sampling 
constant  depth? 

What  is  variability  within  a  system  (rivers, 
estuaries,  coastal  zones,  lakes)? 

How  does  AVS  vary  with  time  and  depth 
within  a  year? 

What  sediment  characteristics  most 
influence  AVS  in  oxidized  surface 
sediments?  In  surface  3  to  6  cm? 

predict  non-steady  state,  nonequilibrium  accumulation  from  temporally  and  spatially 
varying  exposures  when  the  simplifying  assumptions  of  equilibrium  partitioning  models 
are  inappropriate,  for  example,  when  multiple  sources  contribute  significantly  to  accu¬ 
mulation.” 

In  a  kinetic  bioaccumulation  model,  a  set  of  rate  expressions  is  used  to  relate  chemical 
fluxes  among  biological  “compartments”  and  the  environment  (McKim  and  Nichols 
1994).  Kinetic  models  are  more  flexible  and  specific  in  dealing  with  biological  complex¬ 
ity  than  are  equilibrium  models.  The  focus  is  on  the  biological  processes  that  influence 
contaminant  exposures,  and  responses  of  such  processes  are  employed  to  interpret 
geochemical  complexities.  The  greatest  drawback  of  kinetic  models  (and  other  alterna¬ 
tives  to  EqP)  is  that  they  could  require  extensive  data  for  predictive  application  to  specific 
circumstances  (Landrum  et  al  1992;  McKim  and  Nichols  1994).  A  critical  question  is 
whether  it  is  reasonable  to  develop  such  data.  Landrum  et  al  (1992)  evaluated  and  dis¬ 
cussed  the  mathematical  details  of  both  equilibrium  and  kinetic  models  available  to  aid 
exposure  assessments.  Physiological,  pathway,  and  food  web  models  with  both  kinetic 
and  equilibrium  components  have  also  been  described  (Clark  etal  1988;  Thomann  1989; 
Clark  et  al  1990;  Landrum  et  al  1992;  Bremle  et  al  1995;  Kidd  et  al  1995;  LeBlanc  1995). 

The  choice  of  model  for  exposure  assessment  should  account  for  at  least  the  following 
general  sources  of  uncertainty  that  plague  the  simpler  approaches:  1)  nonequilibrium 
conditions  (the  assumption  of  equilibrium  is  critical  to  EqP  but  not  necessary  for  all 
modeling);  2)  chemicals  for  which  the  simple  EqP  approach  is  unsuitable  {e.g.,  sulfides 
are  not  a  factor  affecting  bioavailability  of  some  trace  elements;  bioavailability  of  some 
organic  chemicals  is  not  predictable  from  hydrophobicity);  3)  different  pathways  of  up¬ 
take  (at  least,  models  should  assess  whether  multiple  pathways  of  uptake  add  uncertainty 
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Table  14-3  Some  biological  attributes  and  geochemical  factors  affecting  metal  bioavailability 
from  sediments 


Biological  attributes 

Geochemical  factors 

Feeding  rate 

Water  column  chemistry 

Filtration  rate  across  gills 

Type  of  surfaces  in  sediment 

Food  selection 

Oxidation/reduction  conditions 

Life-cycle  characteristics 

Particle/solution  partitioning 

Microhabitat 

Porewater  DOC  (or  other  factors  affecting  speciation) 

Behavior  with  regard  to  sediment 

Diffusion  and  other  reactions  at  microhabitat  scale 

Digestive  processing 

Biological  productivity 

to  exposure  assessment);  4)  food  web  transfer  of  contaminants  (sediment  risk  assessment 
should  extend  to  higher  trophic  levels  in  the  sedimentary  food  web);  and  5)  biological 
attributes  that  affect  bioavailability  to  a  target  or  sensitive  species.  A  diversity  of  experi¬ 
mental  designs,  beyond  the  short-term  bioassay  paradigm,  will  also  be  necessary  to  con¬ 
vincingly  assess  the  above. 

14.4.1  Nonequilibrium  conditions:  How  common? 

The  assumption  of  equilibrium  inherently  limits  the  circumstances  in  which  EqP  models 
can  be  applied  (Landrum  et  al  1992).  There  is  no  consensus  on  how  frequently  contami¬ 
nant  distributions  among  phases  are  governed  by  nonequilibrium  conditions  nor  about 
the  quantitative  error  that  arises  in  these  circumstances.  Two  of  the  circumstances  in 
nature  that  can  cause  distributions  of  contaminants  to  differ  from  those  predicted  at 
thermodynamic  equilibrium  are  high  temporal  variability  and  biological  control  of  dis¬ 
tributions. 

Key  processes  in  many  ecosystems  are  variable  on  several  time  scales.  In  estuaries,  for 
example,  metal  inputs  are  from  myriad  sources  (rivers,  atmosphere,  local  anthropogenic) 

ties,  dissolved  organic  matter,  resuspension,  hydrodynamics,  and  biological  cycles) 
change  on  several  scales  in  time  and  space.  Intensive  field  studies  in  estuaries  have  clearly 
demonstrated  the  highly  dynamic  nature  of  metal  exposures  (Luoma  1976;  Fisher  and 
Frood  1980;  Cain  and  Luoma  1990;  Luoma  et  al.  1990).  Rivers  also  undergo  changes  in 
physical  conditions,  contaminant  inputs,  and  internal  physicochemical  characteristics 
on  diurnal,  weekly,  and  seasonal  scales.  Again,  highly  unstable  metal  dynamics  can  be 
typical  in  such  ecosystems.  In  contrast,  it  seems  possible  that  some  lakes  and  coastal 
environments,  while  not  stable,  may  be  subject  to  a  regularity  in  physical-chemical 
change  more  amenable  to  maintenance  of  near-equilibrium  conditions. 
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Biological  processes  can  result  in  steady-state  or  varying  contaminant  distributions  that 
deviate  substantially  from  those  predicted  by  thermodynamic  equilibrium.  Wangersky 
(1986)  suggested  that  “control  of  concentrations  of  most  trace  metals  in  surface  waters 
appears  to  be  by  physical  and  chemical  adsorption  on  biological  materials,  and  is  related 
to  local  primary  productivity.”  Distributions  will  eventually  be  predictable  from  thermo¬ 
dynamic  principles  if  biological  controls  exert  their  effect  by  changing  the  abundance 
and  nature  of  surfaces  (although  biological  change  is  often  rapid).  For  example,  the  cy¬ 
cling  and  flux  of  particle-reactive,  non-essential  metals  is  predictably  linked  to  produc¬ 
tion  in  oceanic  systems  (Coale  and  Bruland  1985;  Fisher  et  al.  1987, 1988;  Moore  and 
Dymond  1988).  However,  thermodynamic  equilibrium  will  not  predict  distributions  and 
reactions  where  a)  biological  processes  add  new  intracellular  biochemical  sinks,  b)  biota 
create  and  release  products  not  favored  thermodynamically,  or  c)  biota  incorporate  con¬ 
taminants  into  structures,  compounds,  or  complexes  that  are  either  stable  or  break  down 
slowly. 

For  example,  concentrations  of  Se  species  in  the  oceans  are  explained  by  biological  pro¬ 
cessing,  not  by  thermodynamics.  Phytoplankton  strip  selenite  from  solution,  transform 
it  to  organo-selenium,  and  release  the  latter  both  to  solution  and  to  their  consumers 
(Wrench  and  Measures  1982;  Cutter  and  Bruland  1984;  Reinfelder  and  Fisher  1991; 
Luoma  et  al  1992).  It  is  well  known  that  dissolved  metals  (Cd,  Zn,  and  perhaps  Ni  and 
Cu)  are  “depleted”  by  biological  productivity  {i.e.,  concentrations  are  lower  than  expected 
on  the  basis  of  thermodynamics)  in  the  oceans  and,  at  least  at  times,  in  lakes  (Bruland 
1980;  Sigg  1985;  Reynolds  and  Hamilton-Tayler  1992).  Sunda  and  Huntsman  (1995) 
showed  that  Cu  concentrations  (particulate  and  dissolved)  in  oceanic  nutriclines  are 
regulated  by  phytoplankton  uptake  and  regeneration,  not  by  thermodynamics  alone.  Lee 
and  Fisher  (1992)  demonstrated  that  Cd  and  Zn  are  accumulated  in  the  cytosol  {i.e., 
within  the  cells)  of  phytoplankton,  while  Cu  and  Ag  are  predominantly  sorbed  to  particle 
surfaces.  As  a  result  of  such  reactions,  Cd  and  Zn  concentrations  were  rapidly  depleted 
from  solution  and  increased  in  suspended  materials  during  a  phytoplankton  bloom  in 
South  San  Francisco  Bay  (van  Geen  et  al,  USGS,  in  prep.).  Passage  of  metals  from  phy¬ 
toplankton  to  their  consumers  (Reinfelder  and  Fisher  1991)  or  recycling  of  the  metals 
from  decaying  phytoplankton  (Lee  and  Fisher  1992)  are  also  controlled  by  the  proportion 
of  metal  partitioned  to  the  cytosol  of  the  plants,  a  purely  biological  phenomenon.  Con¬ 
sumer  organisms  also  transform  contaminants.  For  example,  detoxification  of  metals  in 
the  tissues  of  animals  occurs  by  conjugation  into  stable  forms  like  granules  that  may  be 
of  low  bioavailability  to  predators  (Nott  and  Nicolaidou  1990;  Wallace  and  Lopez,  in 
press). 

Equilibrium-based  models  inherently  assume  a  stability  in  all  of  nature.  They  may  mis¬ 
state  exposures  where  biota  control  contaminant  partitioning  between  dissolved  and 
particulate  phases,  where  trophic  transfer  is  influenced  by  biochemically  transformed 
contaminants  or  where  physical-chemical  conditions  are  highly  dynamic.  Comparisons 
of  predictions  between  models  that  assume  equilibrium  and  those  that  do  not  could  be 
fruitful  in  testing  the  quantitative  importance  of  this  source  of  uncertainty. 
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14.4.2  Biological  attributes  that  affect  bioavailability:  Is  metal 
availability  biologically  generic? 

Differences  in  organismic  and  ecological  attributes  of  the  ecosystem  at  risk  are  often  ex¬ 
cluded  from  consideration  in  the  most  geochemically  robust  equilibrium  models 
Figure  14-1).  Multiple  pathways  of  uptake,  bioenergetics,  species-specific  differences  in 
mode  of  exposure  to  sediments  and  associated  waters,  physiological  control  of  uptake, 
differences  in  elimination  rates,  and  food  web  structure  are  potentially  important  but 
often  not  considered  (Clark  et  al  1988, 1990;  Thomann  1989;  Landrum  et  al  1992). 
While  the  conceptual  model  guiding  exposure  assessments  must  include  first-order 
geochemical  principles,  substantial  uncertainties  will  remain  in  specific  predictions  until 
a  robust  appreciation  of  biology  is  also  incorporated. 

Biological  attributes  could  affect  whether  AVS  is  a  relevant  control  on  bioavailability  to  a 
species,  for  example.  Benthic  species  obtain  oxygen  and  their  nutrition  differently  from 
sediments  and  are  exposed  to  different  microenvironments.  Some  oligochaetes  feed 
“head-down”  in  reduced  sediments  and  “breathe”  by  periodically  returning  to  the  oxi¬ 
dized  surface  of  the  sediments  (G.  Lopez,  SUNY  Stony  Brook,  personal  communication). 
These  organisms  would  be  predominantly  exposed  to  AVS-rich,  reduced  sediments  in 
many  environments,  probably  sediments  with  low  metal  bioavailability.  In  contrast, 
most  meiofauna  are  restricted  to  oxidized  layers  of  sediments,  where  metals  are  likely  to 
be  in  their  most  bioavailable  forms.  Many  macrofauna  use  tubes  or  burrows  to  feed  and 
obtain  oxygen  or  particulate  food  from  oxygenated  waters  above  or  near  the  interface 
between  oxidized  and  reduced  sediments,  even  though  their  bodies  are  buried  in  anoxic 
sediments.  As  cited  earlier.  Hare  et  al  (1994)  found  that  AVS-controlled  porewater  con¬ 
centrations  of  Cd  in  spiked  sediments  transplanted  to  a  lake,  but  differences  in  life  his¬ 
tory,  behavior,  and/or  ecology  determined  differences  in  the  Cd  exposure  of  different 
species. 

Different  species  ingest  different  foods,  and  feeding  can  change  within  species  in  re¬ 
sponse  to  their  environment  or  life  stage.  Differences  in  food  sources  affect  trace  element 
assimilation  (Decho  and  Luoma  1991, 1994;  Luoma  etal  1992;  Wang  et  al  1995;  Wang 
and  Fisher  1996);  less  is  known  about  effect  of  feeding  on  assimilation  of  organic  chemi¬ 
cals.  For  example,  Cr  was  generally  not  assimilated  by  bivalves  from  sediment  particles 
but  was  assimilated  with  --90%  efficiency  from  ingested  bacteria  (Decho  and  Luoma 
1991).  Assimilation  efficiencies  of  ingested  metals  varied  among  7  different  algal  diets  in 
the  mussel  {Mytilus  edulis)  (Wang  and  Fisher  1996)  and  changed  approximately  two  fold 
with  the  abundance  of  available  food  (greater  assimilation  at  lower  feeding  rates)  (Wang 
et  al  1995).  Decho  and  Luoma  (1996)  found  that  Cd  assimilation  varied  from  20  to  80% 
between  different  phytoplankton  or  bacteria;  only  5%  to  17%  of  Cd  was  assimilated  from 
sediments  (probably  with  a  minimal  living  component)  or  from  nonviable  particles  (iron 
oxides,  humic  substances,  fulvic  substances).  Deposit  feeding  clams  assimilated  -90%  of 
ingested  Se  from  diatoms  but  only  20%  of  the  Se  when  they  fed  on  sediments  containing 
microbially  deposited  elemental  Se  (Luoma  et  al  1992). 


SETAC  Press 


Session  6:  Critical  Issues  in  Methodological  Uncertainty,  Chapter  1 4 


221 


Figure  14-1  Forms  of  metal  to  which  benthic  organism  might  be  exposed  and  their  contributions  to 
dose  experienced.  All  potential  pathways  of  uptake  do  not  need  to  be  directly  considered 
(dotted  lines)  in  EqP  model  because  all  sources  are  in  equilibrium.  Fugacity  theory  implies 
that  common  process  controls  chemical  potential  in  all  equilibrated  phases  that  are  sources 
of  exposure.  This  is  more  appropriate  for  organic  chemicals  than  for  trace  elements. 
Alternative  models  (e.g.,  kinetic)  assume  all  pathways  are  additive;  therefore,  contribution 
of  each  must  be  quantified  to  correctly  estimate  dose. 


Contaminant  (and  especially  trace-element)  bioavailability  is  not  biologically  generic.  It 
is  not  practical  to  understand  all  biological  factors  for  all  species  {e.g.,  contaminant  as¬ 
similation  from  all  combinations  of  food  sources  available  to  all  benthos).  But  under¬ 
standing,  for  example,  assimilation  from  end  members  in  a  continuum  of  food  sources, 
and  generalizations  about  how  biology  and  ecology  affect  exposures  for  key  species,  may 
be  necessary  for  reliable  exposure  assessments.  The  models  employed  should  be  suffi¬ 
ciently  flexible  to  include  such  considerations. 

14.4.3  Pathways  of  uptake:  Additive  or  equilibrated? 

The  EqP  model  assumes  that  only  one  pathway  of  exposure  needs  to  be  defined.  “It  is 
assumed  that  the  organism  receives  an  equivalent  exposure  from  a  water-only  exposure 
or  from  any  equilibrated  phase;  either  from  pore  water  via  respiration;  from  sediment 
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carbon  via  ingestion;  or  from  a  mixture  of  the  routes”  (Di  Toro  et  al  1991).  Alternative 
models  assume  that  pathways  of  uptake  are  additive  (Thomann  1989;  Clark  et  al  1990; 
Landrum  et  al  1992).  This  fundamental  difference  in  conceptual  models  is  important. 
For  example,  the  applicability  of  water-only  bioassays  implicitly  depends  upon  the  as¬ 
sumption  of  equilibrium.  If  pathways  are  additive,  then  it  is  not  appropriate  to  extrapo¬ 
late  to  nature  from  single-pathway  bioassays.  Such  extrapolations  will  underestimate  the 
full  exposure  of  organisms  ingesting  bioavailable  contaminants  in  their  food  (Luoma 
1995;  Meador  et  al  1995a).  Luoma  et  al  (1992)  used  a  pathway  model  and  environmental 
data  to  show  that  Se  contamination  problems  in  San  Francisco  Bay  occurred  at  much 
lower  concentrations  than  predicted  by  toxicity  or  uptake  from  the  dissolved  phase 
alone.  The  dose  of  Se  that  organisms  received  from  the  dissolved  vector  was  very  low  at 
concentrations  of  available  forms  (selenite)  typical  of  the  Bay,  but  the  dose  received  via 
the  pathway  selenite-phytoplankton-clam-birds  was  sufficient  to  explain  the  environ¬ 
mental  contamination  (Brown  and  Luoma  1995).  Whether  or  not  the  system  was  in  equi¬ 
librium,  bioassay-based  criteria  could  not  account  for  the  additive  threat  achieved  via  the 
food  web. 

The  choice  of  models  also  can  affect  what  is  monitored  for  exposure  assessment.  If  EqP 
is  used  for  risk  assessment,  then  only  sediments  need  be  analyzed  for  the  parameters 
necessary  for  modeling.  If  pathways  are  additive  then  the  different  sources  (water,  sus¬ 
pended  particulates,  sediments,  or  prey  organisms)  must  be  monitored,  unless  their  in¬ 
significance  is  demonstrated. 

Contrary  to  the  observations  of  Luoma  et  al  (1992),  Kemp  and  Swartz  (1988)  found  that 
Cd  bioavailability  was  predictable  from  the  EqP  concepts.  Their  paper  is  one  of  the  im¬ 
portant  references  supporting  the  proposal  that  pore  water  is  the  significant  pathway  of 
Cd  bioaccumulation  by  amphipods.  While  the  experiment  was  rigorous,  a  close  inspec¬ 
tion  of  the  approach  suggests  the  results  were  affected  by  the  sediment-water  distribution 
chosen.  A  Cd  bioaccumulation  model  for  Macoma  balthica  (Table  14-4)  was  applied  to 
the  data  of  Kemp  and  Swartz  (1988),  who  used  Macoma  nasuta  as  an  experimental  ani¬ 
mal.  Porewater  concentrations  of  Cd  were  enhanced  by  lOOOX  to  10,000X  over  the  con¬ 
centrations  found  in  contaminated  estuarine  waters,  while  concentrations  in  food  were 
enhanced  by  16X  to  72X  compared  to  moderately  contaminated  sediments.  The  pathway 
model  predicts  the  same  conclusion  as  the  experiment;  that  is  99%  of  Cd  bioaccumula¬ 
tion  by  M.  nasuta  uptake  should  be  from  pore  waters  with  these  experimental  conditions. 
However,  it  was  because  of  the  difference  in  relative  enhancement  that  food  was  a  trivial 
source  of  bioaccumulation.  Using  conditions  typical  of  a  moderately  contaminated  estu¬ 
ary  (Table  14-4),  the  pathway  model  predicts  that  both  food  and  water  are  sources  of  Cd 
bioaccumulation.  The  experiment  does  not  refute  the  hypothesis  that  sediments  are  not 
a  direct  source  for  contaminant  bioaccumulation  under  the  conditions  that  occur  in  na¬ 
ture.  The  point  is  that  the  conditions  of  exposure  determine  the  relative  importance  of 
pathways,  and  experiments  that  do  not  mimic  conditions  typical  of  nature  {e.g.,  distribu¬ 
tions  between  dissolved  and  sedimentary  concentrations)  will  not  yield  results  that  can 
be  widely  extrapolated  to  nature.  Experiments  that  greatly  enrich  (pore)  water  concentra- 
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Table  14-4  Cadmium  bioaccumulation  via  different  pathways  by  Macoma  balthica  predicted  from  a 
pathway  model 


Experiment 

[Cdjsed 

(Pg/g) 

[Cdlsoln. 

(Pg/l) 

AE 

Css-sobi. 

(Pg/g) 

Css-food 

(pg/g) 

Css-total 

(Pg/d) 

%from 

soln. 

%from 

food 

Kemp  &  Swartz 

16 

1000 

0.1 

2000 

16 

2016 

99.2 

0.8 

Kemp  &  Swartz 

72 

5000 

0.1 

10,000 

72 

10,072 

99.3 

0.7 

Estuary;  some 

1 

0.2 

0.4 

0.4 

4 

4.4 

9 

91 

contamination 

Steady-state  bioaccumulation  (Css)  is  compared  between  the  experimental  conditions  reported  in  Kemp  and  Swartz 
(1988)  and  conditions  typical  of  an  estuary  with  moderate  Cd  contamination. 

Data  for  the  model  derived  from  experiments  reported  by  Luoma  etal  (in  prep.). 

Elimination  rate  was  0.01  d'^ ;  feeding  rate  was  0.1  g  sediment  d'^  g'^  t  issue;  assimilation  efficiencies  were  0.1  for  sediment 
and  0.4  for  sediment  with  living  diatoms;  uptake  rate  from  solution  (Css-soln.)  was  determined  experimentally  at  20%o 
salinity;  Css-soln.  and  Css-food  were  determined  from  influx  rate/elimination  rate. 


tions  of  contaminants  relative  to  other  sources  will  show  dominance  by  that  source  term, 
whatever  the  conceptual  model  of  bioaccumulation. 

The  assumption  of  pathway  equilibrium,  rather  than  pathway  additivity,  has  affected  the 
approach  of  the  studies  that  best  support  EqR  Most  such  studies  (including  the  one  dis¬ 
cussed  above)  exclude  contaminated  food  sources,  especially  the  living  component  of 
natural  sediment.  Most  sediment  collection,  handling,  or  storage  procedures  change  or 
eliminate  the  meiofauna,  diatoms,  and  bacterial  flora  of  natural  sediments.  Assimilation 
of  contaminants  from  these  important  food  sources  could  change  the  outcome  of  experi¬ 
ments. 

Correlative  studies  have  not  differentiated  which  conceptual  model  of  bioaccumulation 
best  reflects  nature.  In  nature,  bioaccumulation  could  correlate  with  dissolved  activity  of 
a  contaminant  if  all  pathways  are  in  equilibrium,  whether  pathways  are  additive  or  not. 
The  significant  correlation  between  predicted  free  Cd  ion  activities  and  Cd  residues  in 
mussel  or  amphipod  tissue  in  the  lakes  of  (Quebec  suggested  equilibrium  distributions 
controlled  bioavailability  (Tessier  et  al  1993;  Amyot  et  al  1994),  but  neither  study 
claimed  that  only  one  pathway  of  exposure  was  significant.  Empirical  correlations  could 
provide  a  basis  for  predicting  contaminant  exposures  (avoiding  the  pathway  question), 
but  first  more  study  is  necessary  of  the  questions  of  ecosystem  equilibrium  (Can  similar 
correlations  be  found  in  a  range  of  environments?)  and  species-specific  bioaccumulation 
(How  do  such  correlations  differ  among  species?). 

There  is  some  direct  experimental  evidence  that  bioaccumulation  pathways  are  additive. 
Experiments  that  compare  water  exposures  to  food  -i-  water  exposures  show  that  when 
contaminants  are  available  from  the  type  of  food  employed,  total  bioaccumulation  or 
toxicity  increases  over  that  from  water  alone;  />.,  uptake  is  additive  (Chandler  et  al  1993; 
see  metals  literature  cited  in  Luoma  1989).  Recent  experiments  that  provided  more  quan¬ 
titative  determinations  of  the  efficiencies  of  assimilation  {e.g.,  Boese  et  al  1990;  Decho 
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and  Luoma  1991;  Reinfelder  and  Fisher  1991;  Wang  and  Fisher  1996;  Wang  et  al  1996) 
may  also  help  resolve  the  contributions  of  pathway  additivity  (Thomann  1989;  Luoma  et 
al  1992;  Thomann  etai  1995;  Ylangetal  1996). 

14.4.4  Food  web  transfer:  Trophic  transfer  beyond  the  benthos? 

The  food  web  beyond  sediment-dwelling  benthos  should  be  included  if  sediment  risk 
assessments  are  to  be  relevant  to  ecosystems.  Several  factors  may  affect  the  distribution 
of  organic  chemicals  between  consumer  organisms,  their  solute  environment,  and  their 
food  sources.  Residues  in  higher  trophic-level  species  are  often  elevated  beyond  those 
predicted  from  hydrophobicity  and  simple  normalizations  (Oliver  and  Niimi  1988; 
Pereira  et  al.  1988;  Clark  et  al  1990;  LeBlanc  1995).  Organic  chemical  bioconcentration 
factor  (BCF)  (for  fish,  e.g)  differs  among  environments  where  prediction  from  laboratory 
studies  forecast  the  same  BCF  (Bremle  et  al  1995).  Exciting  advances  are  occurring  in 
studies  that  combine  ecological  principles  with  principles  that  govern  organic  chemical 
bioaccumulation.  Differences  in  organochlorine  bioaccumulation  by  predators,  for  ex¬ 
ample,  appear  to  occur  among  lakes  because  of  differences  in  food  web  structure  (Kidd 
et  al  1995).  These  results  suggest  that  the  most  accurate  food  web  exposure  assessments 
will  ultimately  come  from  models  incorporating  robust  geochemical,  biological,  and  eco¬ 
logical  concepts  relevant  to  bioaccumulation.  Most  trace  elements  do  not  biomagnify 
through  marine  food  webs  (Fisher  and  Reinfelder  1995),  but  their  trophic  transfer  is  just 
starting  to  be  studied  quantitatively  and  may  be  toxicologically  relevant  under  some  con¬ 
ditions. 

14.5  Are  alternatives  to  EqP  realistic? 

Although  some  studies  have  shown  reliable  predictions  from  simple  EqP  modeling,  the 
important  sources  of  uncertainty  discussed  above  suggest  that  such  models  will  eventu¬ 
ally  prove  inadequate  for  site-specific  exposure  assessment,  especially  for  trace  elements. 
The  best  exposure  assessments  need  to  be  geochemically  robust,  biologically  specific, 
flexible  for  a  variety  of  environmental  circumstances,  and  bioaccumulation  should  be 
quantitatively  predicted,  making  model  predictions  verifiable  in  nature.  The  ideal  model 
would  take  advantage  of  the  equilibrium-based  geochemical  theory  to  define  exposure 
and  would  employ  biologically  robust  modeling  approaches  to  predict  dose  (bioaccumu¬ 
lation). 

Models  that  incorporate  geochemical  and  biological  principles  are  available.  Multiple 
compartment  pharmacokinetic  models,  some  of  which  consider  bioenergetics  and  mul¬ 
tiple  pathways,  have  been  developed  to  predict  tissue  distributions  of  (especially  organic) 
contaminants  in  higher  order  organisms  (Norstrom  et  al  1976;  Thomann  1989;  Clark  et 
al  1988, 1990;  Landrum  et  al  1992;  McKim  and  Nichols  1994).  For  assessing  exposure 
from  sediment  to  benthos,  model  approaches  can  probably  be  simplified  to  assess  whole 
organism  exposures  (one  or  two  compartment  models)  (Pentreath  1973;  Thomann  1989; 
Boese  et  al  1990;  Clark  et  al  1990).  Such  simple  kinetic  models  were  extensively  used  in 
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radioecology  to  study  exposures,  although  multiple  pathways  were  not  always  considered 
(Ruzic  1972;  Pentreath  1973), 

One  way  to  reduce  extensive  data  requirements  of  some  of  the  above  models  is  to  incor¬ 
porate  generic  biological  and  geochemical  constants  (Thomann  1989;  Clark  etal  1990). 
Generic  choices  of  concentration  factors,  distribution  coefficients,  growth  rates,  assimi¬ 
lation  efficiencies,  and  efflux  rate  constants  have  been  used  in  model  projections.  Unfor¬ 
tunately,  all  of  these  can  vary  by  an  order  of  magnitude  or  more  in  specific  circumstances. 
Thus,  generic  models  are  instructive  with  regard  to  broad  principles  governing  contami¬ 
nant  exposure  but  may  have  large  uncertainties  when  employed  in  specific  exposure  as¬ 
sessments. 

For  site-specific  exposure  assessment,  an  alternative  is  what  we  will  call  an  empirical  ki¬ 
netic,  multiple-pathway  model.  Such  a  model  is  similar  in  principle  to  the  generic  kinetic 
models  but  is  made  more  applicable  to  specific  circumstances  by  empirically  developing 
physiological  parameters  representative  of  one  or  more  key  species  native  to  that  system 
and  employing  environmental  data  representative  of  a  range  of  conditions  in  the  system 
(Figure  14-2).  At  first  glance,  obtaining  the  data  for  such  models  appears  onerous 
(Landrum  et  al  1992).  However,  model  behavior  can  be  reasonably  constrained  to  a  re¬ 
stricted  subset  of  parameter  inputs  (McKim  and  Nichols  1994)  and  manageable  methods 
are  available  for  obtaining  species-specific  biological  data,  especially  if  whole  organism 
data  is  the  goal  (Luoma  et  al  1992;  Wang  et  al  1996). 

The  mathematics  of  the  simplest  kinetic  model  with  several  pathways  illustrates  the  nec- 


essary  experiments: 

clC„/dt  =  (If+I„)-Cke 

(14-1). 

For  If 

and 

(14-2) 

where 

^m,ss  “ 

(14-3) 

=  the  concentration  in  animal, 
t  =  time. 

If  =  the  gross  influx  rate  from  food, 

=  the  gross  influx  rate  from  water, 
kg  =  the  rate  constant  of  loss  (slowest  compartment),  and 
Cm  ss  =  the  concentration  at  steady  state. 

For  the  influx  rate  from  food. 

If  =  feed  rate  X  concentration  X  assimilation  (14-4). 

Key  parameters  to  determine  experimentally  include  influx  rates  from  solution,  influx 
rates  from  ingestion,  and  efflux  rates  (Equations  14-1, 14-2;  Figure  14-2).  If  correctly  de¬ 
termined,  rate  constants  for  these  parameters  describe  mechanisms  that  are  directly 
comparable  among  species.  Thus,  in  addition  to  their  use  in  models,  these  data  could 
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Figure  14-2  Description  of  experiments  and  field  data  necessary  to  derive  empirical  pathway  model  for 
benthic  detritus  feeder.  Approach  also  allows  quantitative  determination  of  contributions 
of  food  and  dissolved  pathways  of  bioaccumulation.  Circles  indicate  data  that  should  be 
obtained  from  site  of  interest  (data  ranges  are  preferable  to  single  values).  Squares  indicate 
experiments  or  species-specific  experimentally  derived  parameters.  Bioaccumulation  is 
calculated  from  experiments  and  field  data.  Prediction  can  be  validated  by  direct 
comparison  to  concentrations  observed  in  resident  (or  transplanted)  individuals  from  that 
species  at  site  of  interest. 

lead  to  better  appreciation  of  reasons  for  interspecies  differences  in  bioaccumulation 
and,  perhaps,  why  some  species  are  more  vulnerable  to  contaminants  than  others. 

It  is  very  important  how  the  physiological  parameters  are  determined.  Mechanistically, 
bioaccumulation  results  from  a  combination  of  gross  influx  rate  as  balanced  by  the  gross 
efflux  rate.  Gross  influx  is  a  species-specific  function  of  the  concentration  of  bioavailable 
metal.  Gross  efflux  is  an  instantaneous  function  of  the  concentration  in  tissues  and  the 
rate  constants  of  loss  (Equations  14-1, 14-2).  Typically  in  past  studies,  bio  accumulation 
was  determined  from  exposures  that  lasted  weeks  or  months.  Because  the  results  re¬ 
flected  a  mixture  of  physiological  mechanisms,  they  were  not  suitable  for  model  applica¬ 
tion.  They  could  not  be  quantitatively  extrapolated  to  nature  and  they  were  difficult  to 
compare  quantitatively  to  other  circumstances.  Some  studies  have  purported  to  achieve 
a  steady-state  bioaccumulation,  but  caution  is  warranted  in  concluding  that  fluxes  are  in 
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balance  (Luoma  1977;  Meador  etal  1995a).  For  example,  life-long  exposure  to  dissolved 
®^Zn  was  necessary  to  obtain  uniform  labeling,  and  thus  true  steady  state,  in  mosquito 
fish  (Willis  and  Jones  1977). 

Influx  fi:om  solution  can  be  determined  with  radionuclides  in  short  exposures  {e.g.,  1-d) 
because  the  goal  is  to  estimate  the  unidirectional  flux.  Influx  rates  can  be  studied  as  a 
function  of  concentration,  by  spiking  with  stable  carrier  element,  or  as  a  function  of  spe- 
ciation  by  varying  the  geochemical  characteristics  across  conditions  characteristic  of  the 
ecosystem  of  interest  (or,  alternatively,  using  water  from  the  site  of  interest).  Testing  a 
variety  of  conditions  is  quite  feasible  because  the  experiments  are  short  and  manageable. 

Influx  rates  from  ingestion  vary  with  the  food  source  and  so  are  best  determined  from  the 
product  of  assimilation  efficiencies  (from  specific  types  of  food),  feeding  rate,  and  concen¬ 
tration  (Equation  14-4).  Assimilation  efficiency  (AE)  is  the  factor  for  which  the  least  is 
known  and  is  the  key  to  understanding  contributions  of  different  pathways  of  uptake.  In 
fact,  the  final  concentrations  of  a  metal  contaminant  in  mussels,  for  example,  can  be 
shown  to  be  a  direct  function  of  that  metal’s  AE  (Wang  et  al  1996).  Repeatable  AEs  for  a 
variety  of  food  sources  and  feeding  conditions  can  readily  be  determined  with  inverte¬ 
brates  using  radionuclides  in  pulse-chase  experiments  (Decho  and  Luoma  1991, 1994; 
Fisher  et  al  1991;  Reinfelder  and  Fisher  1991;  Luoma  etal  1992;  Wang  et  al  1995;  Wang 
and  Fisher  1996).  These  studies  showed  that  assimilation  efficiencies  are  not  generic  for 
all  food  types;  a  substantial  variation  was  observed  in  AE  among  food  sources  and  feed¬ 
ing  conditions  for  nearly  every  element  as  well  as  among  elements  and  species. 

Efflux  rates  can  be  described  by  the  rate  constants  of  first-order  isotope-substitution 
kinetics  (Ruzic  1972;  Cutshall  1974).  The  loss  experiments  are  best  conducted  with  radio¬ 
nuclides  in  order  to  assure  that  no  net  influx  of  metal  occurs  during  elimination.  In  most 
invertebrates,  efflux  can  be  described  by  one  to  three  components  of  first-order  exponen¬ 
tial  decay.  Separation  of  the  compartments  is  essential  to  obtaining  repeatable  results 
(Riggs  1963).  The  rate  constants  for  each  compartment  could  vary  with  route  of  expo¬ 
sure,  but  such  variability  appears  to  be  small,  at  least  for  mussels.  A  second  parameter  of 
importance  is  the  proportion  of  metal  in  each  compartment.  This  is  affected  by  exposure 
time  (Cutshall  1974)  and  thus  can  be  experiment-specific.  Exposure  time  will  not  affect 
the  rate  constants  of  loss  if  they  are  determined  by  mathematically  stripping  compart¬ 
ments,  but  if  only  single  compartment  analysis  is  (inappropriately)  employed,  rate  con¬ 
stants  will  artificially  vary  with  exposure.  A  useful  alternative  for  predictive  pathway 
models  could  be  to  assume  that  exposures  in  nature  are  chronic,  and  thus  rate  constants 
from  the  most  slowly  exchanging  compartments  can  be  employed  as  the  efflux  term 
(Luoma  et  al  1992). 

Pathway  models  can  take  advantage  of  site-specific  environmental  data  for  assessing  ex¬ 
posures.  Reliable  methodologies  are  now  available  for  direct  determination  of  contami¬ 
nant  concentrations  in  water,  suspended  materials,  sediments  and  even  biological 
components  of  sediments,  and  in  fact,  reliable  data  already  exist  for  many  ecosystems. 
Site-specific  environmental  data  can  be  combined  with  biological  rate  constants  for  some 
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key  local  species,  to  assess  dose  (bioaccumulation)  under  a  variety  of  conditions  typical 
of  the  site.  Variable  contaminant  concentrations  can  be  included,  as  well  as  variable  DOC 
or  salinity,  changes  in  food  source,  changes  in  sediment  character,  etc.  Exposure  assess¬ 
ment  might  be  more  realistic  if  bioaccumulation  is  projected  for  the  range  of  possibilities 
typical  of  a  system,  rather  than  assuming  that  a  single  numeric  response  characterizes  an 
ecosystem. 

The  model  described  above  is  the  simplest  form  of  a  bioaccumulation  model.  It  lacks,  for 
example,  bioenergetic  terms,  or  considerations  for  seasonal  gain  and  loss  of  lipid  that 
affect  both  trace  element  and  organic  contaminant  bioaccumulation  (Cappuzzo  et  al 
1989;  Cain  and  Luoma  1990).  Even  this  simple  model  approach  appears  to  provide  rea¬ 
sonable  compatibility  with  field  observations  (Luoma  1976;  Luoma  et  al.  1992;  Wang  et 
al.  1996),  although  further  studies  undoubtedly  will  find  ways  to  improve  the  model  pre¬ 
dictions.  Data  needs  for  expanding  even  the  simple,  empirical  pathway  models  are,  at 
present,  large.  However,  as  rate  constants  are  defined  for  common  species  and  as  experi¬ 
ments  with  different  geochemical  conditions  are  related  to  these  mechanistic  biological 
responses,  adequate  data  should  become  available  for  site-specific  exposure  assessments. 

14.6  Extrapolation  from  bioavailable  dose  to  toxicity 

The  empirical  pathway  model  predicts  tissue  residues  directly.  An  advantage  of  this  ap¬ 
proach  is  that  predictions  can  be  unambiguously  compared  to  residues  of  organisms  in 
nature  {e.g.,  to  verify  the  model  approach  under  one  set  of  conditions).  However,  the  resi¬ 
due  output  of  the  pathway  model  means  that  tissue-residue-based  toxicity  extrapolations 
will  be  needed  for  risk  assessment  (Figure  14-3).  Some  data  suggest  that  tissue-residue- 
based  toxicity  predictions  may  be  reasonable  for  organic  contaminants  (Widdows  and 
Donkin  1989),  but  their  mechanistic  feasibility  has  not  been  demonstrated  for  metals. 
This  is  an  important  area  of  research  for  the  future. 

An  advantage  of  equilibrium  models  is  that  contaminant  activities  in  pore  water  can  be 
used  to  extrapolate  toxicity  from  the  large,  existing  database  on  dissolved  toxicity.  These 
data  have  proven  valuable  in  regulatory  issues  to  date,  but  they  will  be  less  useful  if  equi¬ 
librium  models  are  inadequate  estimators  of  exposure  for  risk  assessments.  Perhaps 
more  important  is  the  question  of  whether  the  toxicity  test  approach  will  be  sufficiently 
powerful  as  the  only  estimator  of  toxicity,  if  improved  accuracy  is  demanded  for  site-spe¬ 
cific  risk  assessment.  Large,  undetermined,  and  controversial  uncertainties  lie  in  extrapo¬ 
lating  from  environmental  concentrations  of  a  chemical  to  toxicity,  based  upon  such 
toxicity  tests  (Cairns  1983, 1986;  Moriarity  1988;  Depledge  1989;  Cairns  and  Mount 
1990;  lannuzzi  et  al.  1995;  Luoma  1995).  Mechanistic  reasons  exist  to  expect  that  bioas¬ 
says  can  either  overestimate  (lannuzzi  etal.  1995)  or  underestimate  (Luoma  1995)  toxic 
concentrations  of  contaminants.  The  existing  bioassay  data  are  limited  with  regard  to 
ecologically  key  species,  contaminant  activities  rather  than  concentrations,  multi-genera¬ 
tion  exposures,  compensatory  mechanisms  at  all  levels  of  organization,  interactions  with 
natural  stresses,  etc.  Linkages  between  short-term  exposures  of  organisms  via  a  single 
pathway  in  the  laboratory,  and  complex  toxicity  responses  in  nature  have  been  difficult 
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Figure  14-3  Empirical  pathway  model  generates  tissue-residue  data;  therefore,  requires  understanding 
tissue-residue-based  toxicity 


to  establish;  these  difficulties  partly  explain  why  field  validation  of  toxicity  test  results  has 
not  been  easy  to  accomplish  (Chapman  1986, 1991, 1995).  Field  validation  could  be 
easier  if  residue-based  toxicity  could  at  least  be  used  to  supplement  existing  approaches. 
Of  course,  before  any  approach  to  toxicity  estimation  is  adequate,  we  must  improve 
mechanistic  understanding  of  how  toxicity  is  manifested  in  nature  (Luoma  and  Carter 
1991). 

14.7  Conclusions 

Simple  EqP  models  sometimes  proposed  for  use  in  risk  assessment  may  not  be  suffi¬ 
ciently  flexible  to  deal  with  the  range  of  conditions  that  can  occur  in  ecosystems  (Table 
14-5).  The  greatest  uncertainties  in  assessing  contaminant  exposure  from  sediments  stem 
from  insufficient  understanding  of  the  interplay  between  geochemistry,  biology,  and 
ecology.  It  may  be  feasible  to  reduce  some  of  the  crucial  uncertainties,  however.  The  key 
processes  in  accurate  predictive  models  for  organic  contaminants  will  include  hydropho- 
bicity,  geochemical  corrections  for  factors  that  strongly  influence  bioavailability  (e.g., 
DOC),  consideration  of  the  influence  of  multiple  pathways,  kinetic  influences  such  as 
variable  contaminant  elimination  rates,  and  food  web  complexities.  For  metals,  a  better 
database  needs  to  be  developed  for  key  benthic  biota  describing  the  kinetic  components 
in  bioaccumulation  via  multiple  pathways.  These  include  element-specific  uptake  rates 
from  solution,  elimination  rates,  and  assimilation  efficiencies  from  end  members  among 
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Table  14-5  Some  areas  of  knowledge  deficit  that  cause  uncertainties  in  predictions  of  contaminant 
bioavailability  and  exposure  assessment 


Item 

Knowledge  deficit 

Applicability 

1 

Dissolved  organic  materials  in  pore  waters 

M&O 

2 

Partitioning  in  spiked  vs.  field-contaminated  samples 

0  (M=?) 

3 

Obtaining  representative  sediment  samples 

M 

4 

Relevance  of  the  sediment  sample  to  the  biological  microenvironment 

M 

5 

Role  of  the  living  organisms  within  sediments  on  exposure  and  toxicity  to 
their  consumers 

M&O 

6 

Frequency  of  nonequilibrium  conditions 

M&O 

7 

Additive  pathways  or  all  routes  equilibrated 

M&O 

8 

Efiect  of  specific  biological,  behavioral,  or  ecological  attributes  of  different 
species 

M&O 

9 

Food  web  transfer 

M&O 

10 

Speciation  and  partitioning  models  relevant  to  nature 

M 

M  =  applicable  to  trace  elements 
0  =  applicable  to  organic  contaminants 


food  sources.  Feeding  rates  and  modes  of  exposure  to  sedimentary  contaminants  need  to 
be  determined  for  key  species.  Existing  equilibrium  models  need  to  be  expanded  to  in¬ 
clude  both  AVS  and  oxic  surfaces.  End  member  effects  of  dissolved  speciation  might  be 
linked  into  bioaccumulation  predictions  via  uptake  rate  studies.  Finally,  studies  of  trace 
element  exposure  routes  for  the  upper  trophic  levels  in  the  benthic  food  web  need  to  be 
better  understood.  This  is  a  large  agenda,  but  for  the  first  time,  many  of  the  methodolo¬ 
gies  are  available  to  accomplish  much  of  it,  and  experimental  work  is  underway  to  ad¬ 
dress  at  least  some  of  the  needs. 

Perhaps  the  most  difficult  challenge  confronting  improved  risk  assessment  lies  in  linking 
bioavailability  predictions  to  toxicity,  where  the  uncertainties  of  predictions  from  the 
most  established  approach,  toxicity  tests,  are  large.  It  is  possible  that  the  simple  toxicity 
test  approach  has  reached  inherent  limits  with  substantial  uncertainties  remaining 
unresolvable.  The  tissue-residue-based  toxicity  approach  has  not  received  much  atten¬ 
tion  to  date  but  offers  at  least  the  potential  to  supplement  the  toxicity  test  approach  be¬ 
cause  of  its  less  ambiguous  linkage  to  nature  via  bioaccumulation.  It  first  needs  to  be 
better  demonstrated,  however,  that  tissue-residue-based  toxicity  estimates  are  feasible  at 
all.  Thus  while  reasonable  resolutions  to  important  bioavailabihty  questions  may  be  fast 
approaching,  the  great  debates  of  the  future  about  the  toxicity-bioavailability  linkage 
could  be  more  difficult  to  resolve.  Until  we  better  understand  the  limits  to  exposure  as- 
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sessment  imposed  by  complex  factors  affecting  bioavailability  and  the  limits  to  risk  as¬ 
sessment  imposed  by  the  toxicity-bioavailability  linkage,  science  should  remain  humble 
about  the  accuracy  of  sediment  risk  assessment. 
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15.1  Introduction 

Contaminated  sediments  are  a  major  environmental  problem  in  many  marine,  estuarine, 
and  freshwater  systems.  This  is  often  due  to  hydrophobic  contaminants  which  are  sorbed 
to  and  are  transported  by  suspended  solids.  These  contaminants  are  removed  from  the 
water  column  by  the  settling  and  deposition  of  the  suspended  solids  onto  the  bottom 
sediments.  Conversely,  the  bottom  sediments  can  serve  as  a  source  of  contaminants  to 
the  overlying  water  and  biota  because  of  the  resuspension  of  contaminated  bottom  sedi¬ 
ments  or  bioturbation  and  diffusion  together  with  the  subsequent  desorption  of  the 
sorbed  contaminants.  Since  the  total  amounts  of  contaminants  in  the  bottom  sediments 
can  be  much  greater  than  the  amounts  of  contaminants  in  the  overlying  water,  the  bot¬ 
tom  sediments  can  serve  as  a  major  source  of  contaminants  to  the  overlying  water  and 
biota  long  after  discharges  of  contaminants  into  the  system  have  ended.  These  contami¬ 
nant  source/sink  processes  are  highly  variable  in  both  space  and  time  and  cause  contami¬ 
nant  concentrations  in  the  sediment  bed  which  are  also  highly  variable  in  space  and  time. 

Investigations  of  sediment  toxicity  have  generally  been  limited  by  a  lack  of  understand¬ 
ing  of  the  factors  controlling  contaminant  availability  in  sediments.  This  availability  de¬ 
pends  on  the  transport  and  fate  of  the  contaminants  in  the  sediments  and  in  the 
overlying  water  and  can  be  significantly  affected  by  remedial  actions  and  by  natural,  large 
episodic  events.  A  knowledge  of  the  transport  and  fate  of  sediments  and  the  contami¬ 
nants  associated  with  them  is  therefore  necessary  in  order  to  quantitatively  understand, 
predict,  and  minimize  the  environmental  impact  and  risk  of  contaminated  sediments.  In 
this  chapter,  the  modeling  and  prediction  of  the  transport  and  fate  of  hydrophobic  con¬ 
taminants  is  discussed.  The  flux  of  contaminants  between  the  sediments  and  the  overly¬ 
ing  water  is  emphasized.  Our  present  understanding  of  specific  processes  that  are 
essential  to  this  modeling  is  presented  first.  These  processes  are  a)  the  hydrodynamics;  b) 
the  resuspension  and  erosion  of  bottom  sediments  and  the  dynamics  of  the  sediment 
bed;  c)  the  flocculation,  settling  speeds,  and  deposition  of  suspended  sediments;  and  d) 
the  sorption  of  hydrophobic  contaminants  to  particles.  Recent  models  of  sediment  and 
contaminant  transport  and  fate  and  applications  of  these  models  are  then  reviewed.  The 
emphasis  is  on  truly  predictive  models,  i.e.,  models  which  are  based  on  parameters  that 
can  be  and  are  determined  a  priori  from  laboratory  tests  and  simple  field  process  studies 
and  therefore  do  not  need  extensive  fine-tuning  or  calibration.  These  models  can  be  more 
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easily  extended  to  different  environmental  conditions  and  different  aquatic  systems  than 
models  significantly  dependent  on  calibration. 

As  mentioned  above,  natural  systems  are  highly  variable  in  both  space  and  time.  As  an 
example  of  temporal  variability,  the  flow  rate  in  the  Saginaw  River  from  1940  to  1990  is 
shown  in  Figure  15-1.  Since  the  average  flow  rate  is  approximately  50  mVs,  it  can  be  seen 
that  the  maximum  flows  are  almost  40  times  greater  than  the  average.  Other  rivers  have 
similar  highly  variable  flow  rates,  while  lakes  have  highly  variable  currents  and  wave  ac¬ 
tion  due  to  highly  variable  wind  velocities.  These  large  natural  variabilities,  together  with 
the  nonlinear  dependence  of  sediment  resuspension  on  the  shear  stress  due  to  currents 
and  wave  action,  lead  to  very  large  temporal  and  spatial  variabilities  in  sediment  and 
contaminant  transport  and  fate.  Because  of  this,  it  can  be  shown  that  large  events  such  as 
storms  on  lakes  and  floods  on  rivers,  despite  their  infrequent  occurrence,  are  responsible 
for  much,  if  not  most,  of  the  sediment  transport  in  rivers  and  lakes.  During  these  events, 
the  flux  of  contaminants  from  the  bottom  sediments  to  the  overlying  water  due  to  resus¬ 
pension/deposition  of  sediments  can  be  much  larger  than  the  contaminant  fluxes  due  to 
bioturbation  and  molecular  diffiision.  As  a  result,  large  flow  events  are  also  responsible 
for  most  of  the  contaminant  transport  in  surface  waters  (Lick  1992;  Lick  et  al  1994). 

In  general,  the  transport  of  sediments  and  contaminants  is  a  very  dynamic  process,  with 
the  fluxes  changing  continuously  in  magnitude  and  direction.  There  is  no  steady  state. 
Because  of  this  and  the  nonlinearity  of  the  processes  involved  in  this  transport,  an  aver¬ 
age  state  is  difficult  to  define  and  may  not  be  meaningful.  It  is  the  time-dependent  event, 
especially  the  large  runoff  and/or  storm,  that  must  be  considered  in  the  modeling  and 
prediction  of  the  transport  and  fate  of  sediments  and  contaminants.  This  is  emphasized 
in  the  following  sections. 

15.2  Hydrodynamics 

Currents  and  wave  action  are  significant  processes  when  considering  sediment  transport 
since  a)  they  are  directly  responsible  for  the  transport  of  sediments,  b)  they  cause  turbu¬ 
lence  which  disperses  sediments  both  vertically  and  horizontally,  and  c)  they  cause  a 
shear  stress  (primarily  due  to  turbulence)  at  the  sediment-water  interface  which  is  the 
primary  cause  of  the  resuspension  of  sediments. 

Extensive  work  has  been  done  and  is  continuing  on  the  dynamics  and  especially  the 
modeling  of  currents  in  rivers,  lakes,  estuaries,  and  oceans.  The  most  general  of  these 
models  are  three-dimensional  and  time-dependent  and  include  the  conservation  equa¬ 
tions  for  mass,  momentum,  and  energy  {e.g.,  Sheng  and  Lick  1979;  Blumberg  and  Mellor 
1980;  Paul  and  Lick  1985;  Mellor  1990).  These  models  have  been  extensively  applied  to 
rivers,  lakes,  estuaries,  and  oceans  when  thermal  stratification  is  not  significant  and,  to 
a  lesser  extent,  when  thermal  stratification  is  important  {e.g,  Heinrich  etal  1981, 1983). 
Most  three-dimensional  models  now  include  some  form  of  a  turbulence  closure  sub¬ 
model  (Mellor  and  Yamada  1982)  in  order  to  provide  a  more  realistic  parameterization 
of  the  vertical  mixing  processes  in  stratified  and  non-stratified  situations.  The  determina¬ 
tion  of  the  appropriate  parameters  to  use  in  these  turbulence  models,  especially  during 
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Figure  15-1  Flow  rate,  Saginaw  River  1940-1990 


big  events,  is  still  a  matter  of  concern.  Many  of  these  hydrodynamic  models  have  been 
coupled  to  some  degree  with  water  quality  models,  and  their  use  in  this  context  will  be 
referred  to  below. 

Difficulties  with  three-dimensional,  time-dependent  models  are  that  they  are  inherently 
complex,  consume  large  amounts  of  computer  time,  and  are  difficult  to  program  and  use. 
In  many  cases,  simpler  models  such  as  two-dimensional  (either  vertically  integrated  or 
horizontally  averaged),  time-dependent  models  are  more  appropriate.  These  simpler 
models  are  especially  useful  for  the  general  understanding  of  phenomena  and  the  deter¬ 
mination  of  the  effects  of  variation  in  pertinent  parameters;  in  many  important  situa¬ 
tions,  they  give  completely  adequate  and  accurate  results  for  practical  applications.  In 
particular,  during  large  storms  or  runoffs,  the  waters  in  shallow  bodies  of  water  are  very 
turbulent  and  well-mixed  vertically.  In  this  situation,  vertical  variations  of  density  and 
concentrations  of  sediments  and  contaminants  are  not  significant,  and  a  vertically  inte¬ 
grated  model  is  very  accurate.  Applications  of  these  two-dimensional,  time-dependent 
models  to  water  quality  problems  will  also  be  referred  to  below. 
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Wave  action  is  primarily  responsible  for  the  shear  stress  at  the  sediment-water  interface 
and  hence  the  resuspension  of  bottom  sediments,  and  therefore  needs  to  be  known  accu¬ 
rately.  At  present,  the  most  widely  used  procedures  for  predicting  wave  parameters  are 
semi-empirical  methods,  such  as  the  PNJ  method  developed  by  Pierson,  Neumann,  and 
James  and  the  SMB  method  developed  by  Sverdrup,  Munk,  and  Bretschneider.  These 
procedures  give  the  significant  wave  height  and  significant  wave  period  as  a  function  of 
wind  speed,  fetch,  and  mean  depth.  From  these  relations  and  by  assuming  inviscid  flow, 
one  can  determine  the  horizontal  periodic  flow  at  the  sediment-water  interface.  Near 
this  interface,  a  turbulent  boundary  layer  exists  (on  the  order  of  10  cm  thick  for  typical 
surface  waves)  and  produces  a  shear  stress  at  the  interface.  Kajiura  (1968)  has  analyzed 
this  problem  based  on  the  assumption  of  a  time-independent  but  spatially  varying  verti¬ 
cal  eddy  viscosity.  Recent  work  has  extended  the  analysis  to  the  case  when  waves  and 
currents  are  present  simultaneously  (Grant  and  Madsen  1979;  Christoffersen  and 
Jonsson  1985). 

15,3  Resuspension,  erosion,  and  sediment  bed  dynamics 

In  order  to  predict  the  transport  and  fate  of  any  hydrophobic  contaminant,  it  is  essential 
to  understand  the  resuspension  and  transport  of  bottom  sediments.  The  resuspension  of 
bottom  sediments  is  quite  complex  with  the  resuspension  rate  of  bottom  sediments 
strongly  and  critically  dependent  on  the  applied  shear  stress  due  to  wave  action  and  cur¬ 
rents,  horizontal  location,  depth  below  the  sediment-water  interface,  and  consolidation 
time.  The  types  of  sediment  can  change  rapidly  from  coarse  sands  (in  shallow,  near-shore 
areas  or  where  currents  are  strong)  to  fine-grained  muds  (in  deeper  areas  where  currents 
and  wave  action  are  small).  Sediment  type  can  also  change  vertically  in  the  bottom  sedi¬ 
ments,  as  for  example,  when  there  is  a  layer  of  sand  in  between  layers  of  fine-grained 
muds.  Changes  in  sediment  type  can  lead  to  changes  in  resuspension  rates  by  several 
orders  of  magnitude.  These  rates  can  also  change  with  time  and  distance  from  the  sedi¬ 
ment  water  interface  as  suspended  sediments  are  deposited  over  previously  deposited 
sediments  which  are  more  dense  and  more  compacted.  For  these  reasons,  the  dynamics 
of  bottom  sediments  including  resuspension,  burial  by  suspended  sediments  depositing 
on  the  surface  of  bottom  sediments,  and  the  consolidation  and  dewatering  of  bottom 
sediments  after  deposition  are  all  essential  to  understand. 

Considerable  work  has  been  done  on  the  transport  of  coarse-grained  sediments  by  both 
bed  load  and  suspended  load.  For  summaries  of  this  work,  see  Dyer  (1986)  and  van  Rijn 
(1984a,  1984b).  Many  formulas  have  been  derived,  but  relatively  little  verification  of 
these  formulas  has  been  made. 

More  recently,  the  emphasis  has  been  on  understanding  and  predicting  the  resuspension 
and  deposition  of  fine-grained  sediments.  Laboratory  experiments  on  reconstructed  sedi¬ 
ments  (e.g.,  see  Partheniades  1972;  Mehta  1973;  Fukuda  and  Lick  1980;  MacIntyre  etal. 
1990)  have  determined  the  dependence  of  resuspension  on  various  governing  parameters 
such  as  the  applied  shear  stress,  water  content  or  time  after  deposition,  particle  size  dis¬ 
tribution,  mineralogy,  and  numbers  and  types  of  benthic  organisms.  These  experiments 
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are  usually  done  in  annular  flumes,  although  other  devices  are  also  possible.  However,  by 
their  very  nature,  laboratory  experiments  deal  with  disturbed  sediments.  For  an  adequate 
understanding  of  sediment  and  contaminant  transport,  the  resuspension  properties  of 
undisturbed  sediments,  as  they  are  in  the  field,  need  to  be  known.  For  this  purpose,  a 
portable  resuspension  device  (called  a  shaker)  has  been  devised  and  applied  (Tsai  and 
Lick  1986, 1987;  Ueketal  1995). 

Many  contaminants  are  buried  at  depths  of  up  to  several  meters  in  the  bottom  sediments 
of  rivers,  lakes,  and  estuaries.  A  major  question  is  whether  these  buried  sediments  and 
the  contaminants  associated  with  them  can  be  exposed  and  eroded  during  large  floods 
and  storms.  In  order  to  quantitatively  understand  and  predict  sediment  and  contami¬ 
nant  transport  and  fate  during  these  large  events,  it  is  necessary  to  be  able  to  measure  the 
erosion  properties  of  sediments  at  high  shear  stresses,  on  the  order  of  100  dynes/cm^ 
and  with  depth,  down  to  a  meter  or  more.  However,  both  the  annular  flume  and  shaker 
are  limited  to  shear  stresses  below  about  10  dynes/cm^.  Because  of  this,  both  of  these 
devices  are  only  capable  of  resuspending  the  surficial  layers  of  the  sediments,  usually  only 
the  top  few  millimeters. 

In  order  to  determine  the  erosion  properties  of  relatively  undisturbed  sediments  at  high 
shear  stresses  and  with  depth,  a  unique  flume  (Sedflume)  has  been  devised,  developed, 
and  tested  (McNeil  et  al  1995).  This  is  shown  in  Figure  15-2  and  is  such  that  box  cores 
can  be  inserted  into  the  bottom  of  the  test  section  of  the  flume.  By  extrusion,  the  sedi¬ 
ments  in  these  cores  can  then  be  continuously  exposed  to  a  shear  flow.  Measurements  at 
very  high  shear  stresses,  on  the  order  of  100  dynes/cm^  are  possible.  Tests  have  been 
made  on  reconstructed  sediments  in  the  laboratory  and  on  cores  of  undisturbed  bottom 
sediments  from  the  Detroit  River  in  Michigan  and  the  Fox  River  in  Wisconsin.  Erosion 
rates  of  sediments  from  different  locations  at  low  to  high  shear  stresses  and  as  a  function 
of  depth  down  to  1  and  2  m  have  been  measured.  Examples  of  test  results  for  cores  from 
the  Detroit  River  are  shown  in  Figures  15-3a  and  15-3b.  Figure  15-3a  shows  the  erosion 
rate  as  a  function  of  depth  for  different  shear  stresses  for  a  core  consisting  of  a  dark  gray 
silt  near  the  surface  changing  gradually  to  a  silt  and  clay  sediment  further  down  in  the 
core.  Since  the  type  of  sediment  changes  relatively  little  with  depth,  the  erosion  rates  vary 
relatively  smoothly  with  depth.  Figure  15-3b  shows  the  erosion  rates  as  a  function  of 
depth  for  a  core  which  is  highly  stratified.  A  surface  layer  consists  of  coarse  sands  and 
other  debris.  Below  this  is  a  very  firm  layer  of  peat,  followed  by  a  layer  of  peat  mixed  with 
silt.  The  core  ends  in  a  firm  clay  layer.  The  peat  layer  and  the  clay  layer  are  extremely  dif¬ 
ficult  to  erode,  while  the  coarse  surface  layer  and  the  layer  of  peat  and  silt  are  relatively 
easy  to  erode. 

These  are  just  two  examples  of  cores  already  taken  and  tested.  In  general,  it  can  be  shown 
that  the  shear  needed  to  erode  sediments  varies  by  more  than  an  order  of  magnitude  as 
the  depth  in  the  sediment  increases  and  at  different  locations  in  these  rivers.  From  these 
measurements,  a  reasonably  accurate  description  of  the  resuspension  properties  of  un¬ 
disturbed  sediments  in  these  two  rivers  is  being  obtained. 
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TOP  VIEW 

Ozz 

PUMP 

SIDE  VIEW 


Figure  15-2  Essentially  straight  sedflume  with  rectangular  cross-section  test  section  and  open  bottom 
through  which  rectangular  cross-section  coring  tube  containing  sediment  can  be  inserted 
(By  extrusion,  sediments  in  these  cores  can  then  be  continuously  exposed  to  shear  flow.) 


15.4  Flocculation,  settling  speeds,  and  deposition 

Fine-grained  sediments  and  particles  always  flocculate  to  some  extent.  This  flocculation 
affects  the  transport  of  the  particles  due  to  its  effect  on  settling  speeds  and  also  affects  the 
adsorption-desorption  process.  Extensive  work  has  been  done  on  the  aggregation  and 
disaggregation  of  sediments  where  the  basic  particle  sizes  are  from  1  |Jm  to  10  jlm.  From 
these  basic  particles,  floes  can  form  which  are  up  to  several  centimeters  in  size  and  which 
contain  10®  to  10®  basic  particles. 

In  experiments  using  Couette  and  diskflocculators,  the  time-dependent  behavior  of  these 
floes  has  been  investigated.  In  particular,  the  effects  of  fluid  shear,  sediment  concentra¬ 
tion,  salinity  (ionic  strength),  and  organic  matter  on  the  aggregation  and  disaggregation 
rates;  the  steady-state  equilibrium  sizes;  and  the  settling  speeds  of  the  floes  have  been 
determined  (Tsai  and  Lick  1987;  Burban  et  al  1989, 1990;  Lick  et  al  1993).  Figure  15-4 
shows  some  experimental  results  on  the  median  diameter  of  floes  as  a  function  of  time  in 
the  case  when  differential  settling  is  the  dominant  mechanism  for  flocculation.  Initially, 
the  floes  are  disaggregated  into  the  basic  particles,  which  are  about  6  pm  in  diameter. 
Due  to  differential  settling,  these  particles  then  flocculate  with  time  as  shown  and  even- 
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Figure  15-3a  Erosion  rates  as  function  of  depth  with  shear  stress  as  parameter  for  sediment  core  from 
Detroit  River:  sediment  silts  and  clays  change  relatively  little  with  depth 
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Figure  15-3b  Erosion  rates  as  function  of  depth  with  shear  stress  as  parameter  for  sediment  core  from 
Detroit  River:  highly  stratified  sediments 
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tually  reach  a  steady  state.  In  general,  it  can  be  shown  that  the  diameter  of  the  steady- 
state  floes  and  the  time  to  steady  state  increase  as  the  sediment  concentration,  fluid  shear, 
and  salinity  decrease.  Theoretical  analyses  of  the  experimental  results  have  also  been 
made  (Lick  and  Lick  1988;  Lick  et  al.  1993).  The  analyses  are  based  on  a  general  formula 
for  the  time  rate  of  change  of  the  particle  size  distribution,  which  includes  aggregation 
due  to  collisions  and  disaggregation  due  to  fluid  shear  and  collisions.  The  theory  can 
accurately  reproduce  the  experimental  results  of  the  median  diameter  of  the  floes  as  a 
function  of  time  and  also  the  steady-state  floe  size  distribution.  The  general  theory  has 
also  been  simplified  so  as  to  make  it  much  more  numerically  efficient  (Lick  etal  1992). 
From  these  studies,  a  fairly  complete  description  of  flocculation  for  a  wide  and  realistic 
range  of  parameters  (fluid  shear,  sediment  concentration,  salinity,  concentration  of  or¬ 
ganic  matter)  is  now  available. 

Although  a  knowledge  of  settling  speeds  is  essential  in  predicting  transport  in  the  over- 
lying  water,  the  deposition  of  particles  on  the  sediment  bed  is  not  only  determined 
uniquely  by  the  settling  speed  but  also  depends  on  the  shear  stress  (turbulence)  at  the 
sediment-water  interface,  i.e.,  only  large,  dense  particles  and  floes  settle  out  in  a  highly 
turbulent  flow  while  almost  all  particles,  except  the  finest,  settle  out  in  an  almost  quies¬ 
cent  flow.  Little  quantitative  information  is  available  on  this  dependence  of  deposition 
rate  on  size  and  density  of  the  particles  and  the  turbulence  of  the  flow. 

15.5  Sorption  of  hydrophobic  organic  chemicals  to  particles 

In  most  previous  work  on  the  sorption  of  hydrophobic  organic  chemicals  to  particles,  it 
has  been  assumed  that  chemical  equilibrium  exists  and  that  this  equilibrium  can  be 
quantified  by  means  of  a  partition  coefficient,  K^.  However,  in  recent  work,  it  has  been 
noted  that  although  the  initial  adsorption  or  desorption  of  a  hydrophobic  chemical  can 
be  quite  rapid,  with  time  scales  of  minutes  to  hours,  the  final  equilibration  may  take  days 
to  months  or  even  longer.  A  confusing  factor  in  most  experiments  is  the  observation  that 
the  rates  of  both  adsorption  and  desorption  seem  to  depend  on  the  concentration  of  sus¬ 
pended  solids.  This  is  called  the  solids  concentration  effect. 

Considerable  experimental  work  has  been  done  on  the  sorption  and  partitioning  of  or¬ 
ganic  chemicals  to  sediments  and  saturated  soils.  In  almost  all  cases,  the  experiments 
were  short-term,  hours  to  a  few  days.  An  extensive  review  of  the  work  on  sediments  has 
been  given  by  Di  Toro  et  al.  (1991),  while  work  on  soils  has  been  recently  summarized  in 
two  conference  proceedings  (Sawhney  and  Brown  1989;  Baker  1991).  Katickhoff  and 
Morris  (1985)  present  and  discuss  the  results  of  long-term  purge  experiments  on  the 
sorption  dynamics  of  several  different  hydrophobic  pollutants  in  several  different  sedi¬ 
ment  suspensions.  They  demonstrated  that,  for  many  organic  chemicals,  sorption  is  a 
slow  process  and  equilibrium  might  take  days  to  months  to  achieve.  In  their  experiments, 
a  solids  concentration  effect  was  present  and  seemed  to  be  greater  for  cohesive  sediments 
than  for  non-cohesive  sediments.  Coates  and  Elzermann  (1986)  studied  the  desorption 
of  PCBs  from  sediments.  Their  data  indicated  that  equilibration  times  for  PCBs  in  sedi- 
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Figure  15-4  Median  floe  diameter  as  function  of  time  with  sediment  concentration  as  a  parameter 
(flocculation  due  to  differential  settling  of  particles) 


ments  could  be  quite  long,  probably  on  the  order  of  six  weeks  for  PCBs  with  low  chlorine 
content  and  months  to  years  for  PCBs  with  a  significantly  higher  chlorine  content. 
Gschwend  and  Wu  (Gschwend  and  Wu  1985;  Wu  and  Gschwend  1986)  in  their  sorption 
studies  have  shown  that,  for  many  experiments,  the  solids  concentration  effect  could  be 
attributed  to  analytical  artifacts  caused  by  incomplete  phase  separations.  Their  experi¬ 
ments  were  short-term  (1  to  3  days).  Jepsen  et  al  (1995)  have  investigated  the  time- 
dependent  sorption  and  partitioning  of  hexachlorobenzene  to  natural  sediments  by 
means  of  batch  mixing  experiments.  Experiments  were  conducted  for  long  times  (2  to  5 
months)  in  order  to  reach  sorption  equihbrium  and  at  different  sediment  concentrations 
(10, 100,  500,  2000,  and  10,000  mg/L).  Experiments  quantitatively  demonstrated  the 
time-dependent  effects  on  sorption  of  hexachlorobenzene  dissolution,  of  the  flocculation 
of  colloidal  matter  in  the  water,  and  of  the  flocculation  of  the  suspended  sediments.  Erom 
the  data,  a  true  equilibrium  partition  coefficient  (independent  of  solids  concentration)  as 
well  as  the  approximate  times  to  equilibrium  were  determined.  A  model  for  the  sorption 
of  hydrophobic  chemicals  to  suspended  sediments  has  recently  been  developed  (Lick  and 
Rapaka  1996)  which  quantitatively  describes  both  adsorption  and  desorption  processes 
and  gives  good  agreement  with  experimental  results. 
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15.6  Models  of  sediment  and  contaminant  transport  and  fate 

Many  models  of  sediment  transport  have  been  developed.  These  models  have  been 
mainly  concerned  with  the  description  of  the  suspended  solids  concentration,  and  little 
effort  has  been  made  to  accurately  describe  the  sediment  bed  and  its  properties,  espe¬ 
cially  as  the  bed  changes  due  to  resuspension,  deposition,  and  compaction.  None  of  the 
models  have  included  accurate  descriptions  of  all  the  significant  processes,  especially 
flocculation.  However,  through  experimental  and  field  work,  the  processes  governing 
sediment  resuspension,  transport,  and  fate  have  come  to  be  better  understood  and  quan¬ 
tified  and  are  now  being  incorporated  into  more  accurate  and  more  general  sediment 
transport  models.  A  recent  calculation  is  for  the  Fox  River  (Gailani  et  al  1991).  In  this 
calculation,  the  authors  used  laboratory  and  field  measurements  to  determine  the  resus¬ 
pension  parameters;  a  quasi-equilibrium  model  of  flocculation  for  simplicity;  settling 
speeds  as  measured  in  the  laboratory;  a  two-dimensional  (vertically  integrated),  time- 
dependent,  hydrodynamic  and  sediment  transport  model;  an  SMB  model  of  wave  action 
when  waves  were  significant;  a  three-dimensional,  time-dependent  sediment  bed  model 
with  properties  based  on  experimental  work;  suspended  solids  concentrations  measured 
at  the  DePere  Dam  (the  upstream  boundary)  as  input;  and  suspended  solids  concentra¬ 
tions  at  the  river  mouth  at  Green  Bay  (the  downstream  boundary)  as  verification.  Calcu¬ 
lations  of  suspended  solids  concentrations  as  a  function  of  time  were  made  for  three 
major  runoff  events  and  for  steady  flows  at  high,  medium,  and  low  flow  rates.  For  the 
major  runoff  events,  excellent  agreement  was  obtained  between  the  calculated  and  ob¬ 
served  suspended  solids  concentrations.  No  data  were  available  to  verify  changes  in 
bathymetry  due  to  resuspension  and  deposition. 

However,  changes  in  bathymetry  over  a  6-month  period  were  measured  for  the  Saginaw 
River  by  the  USAGE.  For  this  same  period,  essentially  the  same  model  as  described  above 
(but  extended  to  include  curvilinear  coordinates  and  bed  load)  was  used  to  calculate  sedi¬ 
ment  transport  in  the  Saginaw  including  changes  in  bathymetry  due  to  resuspension, 
deposition,  and  bed  load  (Cardenas  et  al.  1995).  Good  agreement  between  the  observa¬ 
tions  and  calculations  was  obtained.  An  example  of  a  predictive  calculation  is  given  in 
Figure  15-5.  This  shows  the  predicted  changes  in  resuspension  and  deposition  for  a  25- 
year  period.  Erosion  occurs  in  the  channel  while  deposition  occurs  in  the  shallow  near¬ 
shore  areas. 

More  recently,  these  calculations  have  been  extended  to  include  an  investigation  of  the 
transport  and  fate  of  PCBs  in  the  Saginaw  River.  The  emphasis  was  on  the  effects  of  large 
flow  events,  incoming  upstream  PCB  loads,  and  burial  of  contaminated  sediments  by 
clean  sediments  with  subsequent  erosion  of  sediments  by  a  large  flow  event.  In  previous 
modeling  of  contaminant  transport  and  fate,  the  dynamics  of  the  sediment  bed  were 
such  that  the  sediments  and  contaminants  in  the  bed  were  generally  well  mixed  verti¬ 
cally.  This  mixing  was  primarily  due  to  the  numerical  procedures  and  approximations 
used.  Because  of  this  mixing,  depositing  contaminated  sediments  were  mixed  with  clean 
sediments  and  hence  diluted.  These  sediments,  when  subsequently  resuspended,  would 
then  have  a  lower  contaminant  concentration  and  a  lesser  effect  on  water  quality  than  if 
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Figure  15-5  Resuspension  and  deposition:  25-yr  simulation.  Shaded  area  is  erosional;  unshaded  areas  are 
depositional  Resuspension  contours  from  0  g/cm^  to  45  in  15  g/cm^  intervals.  Deposition 
contours  from  20  g/cm^  to  100  g/cm^  in  40  g/cm'^  intervals.  Dashed  line  is  zero  erosion/deposition. 


they  had  not  been  diluted.  The  extent  of  this  dilution  depends  on  the  thickness  of  the 
sediment  bed,  a  parameter  which  was  generally  somewhat  arbitrarily  specified.  In  the 
absence  of  benthic  organisms,  this  mixing  should  not  occur.  Even  when  organisms  are 
present  and  active,  this  mixing  should  not  occur  to  a  depth  of  more  than  a  few  centime¬ 
ters,  possibly  as  much  as  10  cm.  In  the  latest  calculations,  discrete  layers  of  sediment  (and 
the  contaminants  associated  with  them)  can  be  resuspended  and/or  deposited  without 
numerical  mixing.  This  capability  is  essential  in  accurately  predicting  the  long-term 
transport  and  fate  of  contaminants  and  nutrients  and  hence  the  water  quality  of  rivers 
and  lakes  over  the  long  term.  Another  interesting  calculation  is  that  of  the  resuspension 
and  transport  of  sediments  in  Lake  Erie  (Lick  et  al  1994).  The  emphasis  in  this  article  was 
on  the  effects  of  major  storms.  Calculations  were  made  for  different  constant  wind  speeds 
and  wind  directions  and  also  for  the  November  1940  storm  (see  Figure  15-6),  one  of  the 
largest  in  the  last  century.  It  can  be  seen  that,  for  this  storm,  up  to  a  meter  or  more  of 
sediment  can  be  eroded  from  near-shore  areas,  while  up  to  20  cm  of  sediment  can  be 
deposited  in  other  areas  further  off  shore.  In  general,  the  numerical  results  indicate  that 
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major  storms,  despite  their  infrequent  occurrence,  are  responsible  for  most  of  the  resus¬ 
pension  and  transport  of  sediments  in  Lake  Erie.  For  purposes  of  verification,  the  results 
of  numerical  calculations  were  compared  with  and  ^^Ts  data  from  a  sediment  core 

obtained  by  Robbins  et  al  (1978)  at  an  Eastern  Basin  location.  The  data  indicates  that 
deposition  at  this  site  was  very  nonuniform  with  time,  with  infrequent  large  depositions 
caused  by  major  storms  which  were  separated  by  long  periods  of  time  in  which  very  little 
deposition  occurred.  The  results  of  the  calculated  deposition  are  consistent  with  this 
idea. 

Sediment-water  fluxes  occur  primarily  by  a  combination  of  three  processes:  resuspen¬ 
sion/deposition,  bioturbation,  and  diffusion.  Each  of  these  processes  is  distinctly  differ¬ 
ent  from  the  others.  In  general,  these  processes  occur  simultaneously  and  there  are 
interactions  among  them.  However,  in  many  realistic  situations,  one  of  the  processes  is 
dominant  over  the  others  and  so,  to  a  good  approximation,  can  be  considered  as  inde¬ 
pendent.  All  of  these  processes  are  affected  by  the  adsorption-desorption  process  for 
hydrophobic  chemicals.  Contaminant  fluxes  due  to  diffusion  are  generally  small  com¬ 
pared  to  fluxes  due  to  resuspension/deposition  and  bioturbation.  Bioturbation  has  often 
been  assumed  to  be  the  dominant  cause  for  contaminant  fluxes  from  the  bottom  sedi¬ 
ments.  The  justification  for  this  has  been  the  surficial  layers  in  some  areas  of  lakes  which 
have  fairly  uniform  composition.  This  uniform  composition  is  deduced  from  radiomet¬ 
ric  dating  and  is  attributed  to  benthic  organisms,  which  presumably  mix  these  layers  by 
furrowing  and  burrowing.  From  a  consideration  of  sediment  dynamics,  an  alternative 
and  more  plausible  explanation  for  this  in  many  instances  is  that  this  layer  of  uniform 
properties  is  due  to  episodic  resuspension/deposition  events  (Lick  1992). 

15.7  Conclusions 

Contaminated  sediments  can  cause  significant  environmental  problems  in  marine,  estua¬ 
rine,  and  freshwater  systems.  In  order  to  quantitatively  understand,  predict,  and  mini¬ 
mize  the  environmental  impact  and  risk  of  these  sediments,  a  knowledge  of  the  transport 
and  fate  of  the  sediments  and  the  contaminants  associated  with  them  is  necessary.  In  this 
chapter,  the  modeling  of  this  transport  and  fate  and  the  processes  that  are  essential  to 
this  modeling  were  discussed.  These  processes  are  a)  the  hydrodynamics;  b)  the  resus¬ 
pension  and  erosion  of  bottom  sediments  and  the  dynamics  of  the  sediment  bed;  c)  the 
flocculation,  settling  speeds,  and  deposition  of  suspended  sediments;  and  d)  the  sorption 
of  hydrophobic  contaminants  to  particles.  A  brief  review  of  recent  models  of  sediment 
and  contaminant  transport  and  fate  was  then  given.  From  results  of  calculations  and 
observations,  it  can  be  shown  that  the  sediment-water  fluxes  of  contaminants  and  the 
contaminant  concentrations  in  the  sediment  bed  are  highly  variable  in  both  space  and 
time.  This  variability  must  be  taken  into  account  in  modeling  and  in  assessing  the  expo¬ 
sure  of  organisms  to  contaminants.  Because  large  flow  events  are  responsible  for  most  of 
the  sediment  and  contaminant  transport  in  surface  waters,  these  large  flow  events  must 
be  considered  in  detail  in  the  modeling. 
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Figure  15-6  Net  changes  in  sediment  bed  thickness  in  Lake  Erie  during  major  storms,  November  1940 
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Sediments  as  complex  mixtures: 
an  overview  of  methods  to 
assess  ecotoxicological  significance 

Richard  C  Swartz,  Dominic  M.  Di  Toro 

16.1  The  reality  of  mixtures  of  sediment  contaminants 

Sediments  from  many  urbanized  embayments  contain  hundreds  of  individual  contami¬ 
nants.  The  limit  to  the  number  of  chemicals  that  can  be  detected  in  a  sediment  sample  is 
essentially  determined  by  the  accuracy  and  precision  of  analytical  methods.  Risebrough 
(1994)  routinely  quantified  93  organic  chemicals  in  a  sediment  survey  in  San  Francisco 
Bay.  Over  100  chlorinated  hydrocarbon  compounds  alone  were  measured  in  some  sedi¬ 
ment  samples  from  Puget  Sound,  Washington  (Malins  et  al  1980).  Thus,  the  term  com¬ 
plex  mixture  is  especially  applicable  to  sediment  contaminants.  The  complexity  of  the 
mixture  may  never  be  completely  understood  because  of  limited  funds  and  analytical 
methods.  However,  the  reality  of  the  mixture  and  its  ecotoxicological  significance  must 
always  be  recognized.  The  obvious  message  is  that  multiple  sediment  contaminants  have 
a  potential  for  combined  effects  that  might  not  be  predicted  by  a  risk  assessment  based 
on  individual  chemicals. 

Two  aspects  of  sediment  contaminant  mixtures  that  warrant  special  attention  are  co¬ 
occurrence  and  covariance.  Certain  compounds  tend  to  have  universal  co-occurrence 
because  of  a  common  source  or  chemical  relation,  e.g.,  DDT  and  its  metabolites,  PCB  iso¬ 
mers,  higher  molecular  weight  PAHs.  Co-occurrence  of  chemicals  in  a  given  study  area 
often  exists  among  diverse  chemicals  because  of  common,  site-specific  sources  of  con¬ 
tamination,  e.g.,  a  sewage  or  industrial  outfall.  Chemicals  with  common  sources  tend  to 
show  spatial  and  temporal  covariance  in  their  distribution  patterns  {e.g.,  the  distribution 
of  DDT  and  dieldrin  in  the  Lauritzen  Channel  off  Richmond  Harbor  CA,  Figure  16-1; 
metals  on  the  Palos  Verdes  Shelf  CA,  Figure  16-2).  In  depositional  environments,  chemi¬ 
cals  may  covary  in  sediment  depth  profiles  {e.g.,  the  distribution  of  Cd,  OC,  DDE,  and 
toxicity  in  sediment  profiles  from  the  Palos  Verdes  Shelf  CA,  Figure  16-3). 

Co-occurrence  and  covariance  of  sediment  contaminants  often  result  in  significant  cor¬ 
relations  of  distribution  among  chemical,  biological,  and  toxicological  parameters 
(Ferraro  et  al.  1991;  Canfield  et  al.  1996).  These  correlations  do  not  signify  causal  rela¬ 
tions.  For  example,  the  correlation  of  DDT  and  dieldrin  with  toxicity  in  the  Lauritzen 
Channel  does  not  provide  sufficient  evidence,  by  itself,  that  either  chemical  contributed 
to  the  observed  toxicity  (Swartz  etal.  1994].  The  distinction  between  effects  and  exposure 
is  inherent  in  the  fundamental  risk  assessment  paradigm.  Correlation  of  chemical  con- 
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Figure  16-1  Covariance  of  XDDT  and  dieldrin  in  the  Lauritzen  Channel  and  Richmond  Inner  Harbor 
CA  (after  Swartz  et  al.  1994) 


•  Cr  O  Pb  ■  Ni  □  Cd 

Figure  16-2  Spatial  and  temporal  changes  in  metals  concentrations  in  sediment  on  the  Palos  Verdes 
Shelf  CA  (after  Ferraro  et  al.  1991) 
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Figure  16-3  Vertical  distribution  of  percent  mortality  of  Rhepoxynius  abronius,  TOC,  Cd,  and  p,  p'-DDE 
in  sediment  cores  from  the  Palos  Verdes  Shelf  CA  (after  Swartz  etal.  1991) 


centration  and  biological  response  only  establishes  potential  exposure.  The  effects  assess¬ 
ment  must  be  based  on  independent  evaluation  of  causality. 

16.2  Risk  assessment  based  on  toxic  unit  hypothesis 

The  simplest  way  to  compare  the  hazard  of  individual  chemicals  in  sediment  contami¬ 
nant  mixtures  is  to  normalize  concentrations  to  an  independently  derived  effects  concen¬ 
tration  or  benchmark.  The  benchmark  might  be  an  S(^G,  an  NOEC,  or  an  LC50.  In  the 
latter  case,  the  normalized  concentrations  are  expressed  as  TUs:  number  of  TU  =  ambi¬ 
ent  concentration/LC50  (Sprague  1970).  This  approach  allows  comparison  of  the  relative 
potential  contribution  of  individual  chemicals  to  observed  or  predicted  effects.  It  does 
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not,  by  itself,  indicate  the  cumulative  effect  of  the  combination  of  chemicals  in  the  mix¬ 
ture. 

The  TU  concentration  of  sediment  contaminants  can  be  expressed  as  either  porewater 
TUs  {e.g.,  Kemble  et  al.  1994;  Pesch  et  al.  1995)  or  OC-normalized  sediment  TUs  {e.g, 
Swartz  et  al  1994).  The  latter  approach  was  used  to  assess  the  relative  contribution  of 
PAHs,  PCBs,  dieldrin,  DDT,  and  metals  in  the  Lauritzen  and  Santa  Fe  Channels,  CA.  Sedi¬ 
ments  in  both  Channels  were  acutely  toxic  to  marine  amphipods  (Pinza  etal  1992;  White 
et  al  1993;  Swartz  et  al  1994).  The  mean  sediment  concentration  of  PAHs,  PCBs,  dield¬ 
rin,  and  metals  in  the  Lauritzen  Channel  was  <  0.28  TU  (Table  16-1).  DDT  and  its  me¬ 
tabolites  had  a  mean  concentration  of  5.3  TU.  On  this  basis,  DDT  was  identified  as  the 
dominant  ecotoxicological  factor  (Swartz  et  al  1994).  Although  dieldrin  and  DDT  were 
highly  correlated  with  each  other  (Figure  16-1)  and  with  toxicity,  a  comparison  of  their 
relative  hazard  in  the  Lauritzen  Channel  was  made  possible  by  the  TU  analysis.  Similarly, 
PAH  compounds  (TU  =  1.7)  were  identified  as  the  major  source  of  toxicity  in  the  Santa  Fe 
Channel  because  the  TUs  for  SDDT,  dieldrin,  PCB,  and  metals  were  <  0.24  TU  (Table 
16-1). 


16.3  Metal  mixtures:  (SEM  -  AVS) 

Acid  volatile  sulfide  is  the  dominant  binding  phase  for  some  divalent  metals  (Cd,  Cu,  Ni, 
Pb,  Zn)  in  anaerobic  sediment  (Di  Toro  et  al  1990).  Acid  volatile  sulfide  is  composed 
principally  of  solid  phase,  iron  monosulfides  (FeS(s))  that  are  soluble  in  cold  acid  and 
thus  extracted  as  AVS.  FeS(s)  is  in  equilibrium  with  aqueous  phase  sulfide  by  the  reaction 

FeS(s)oFe2+  +  s2-. 

Certain  metals  (Cu,  Zn,  Pb,  Ni,  Cd)  react  with  solid  phase  AVS  to  form  metal  sulfide  pre¬ 
cipitates  that  are  very  insoluble  in  pore  water: 

Me2+  +  FeS(s)  ^  MeS(s)  +  Fe^^. 

Thus,  the  iron  in  FeS(s)  is  displaced  by  metal  to  form  soluble  iron  and  solid  metal  sulfide. 
This  displacement  occurs  because  the  metal  sulfide  solubility  parameters  of  CuS,  ZnS, 
PbS,  NiS,  and  CdS  are  all  less  than  that  of  FeS  (Di  Toro  et  al  1990).  As  long  as  the  molar 
concentration  of  AVS  in  sediment  exceeds  the  combined  molar  concentration  of  Cu,  Zn, 
Pb,  Ni,  and  Cd,  these  five  metals  will  form  solid  sulfides  and  have  very  low  porewater 
concentrations. 

The  appropriate  quantification  of  the  combined  molar  concentration  of  these  five  metals 
is  termed  simultaneously  extracted  metal  (SEM),  le.,  the  metal  which  is  extracted  in  the 
cold  acid  used  in  the  AVS  procedure  (Di  Toro  et  al  1990).  Thus,  when  SEM  -  AVS  <  0, 
porewater  concentrations  of  all  five  metals  will  be  quite  low  because  of  the  formation  and 
low  solubility  of  their  metal  sulfides. 

Sediment  toxicity  can  be  predicted  from  porewater  concentrations  of  metals  (Swartz  et  al 
1986,  Kemp  and  Swartz  1988).  Extensive  research  has  shown  that  when  SEM  -  AVS  <  0 
(or  SEM/AVS  <  1),  porewater  concentrations  of  metals  are  too  low  to  cause  toxicity  to 
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Table  16-1  Toxic  unit  concentrations  of  organic  carbon-normalized  sediment  contaminants  in 
Lauritzen  and  Santa  Fe  Channels 


Contaminant 

TU  value,  LC5Q  =  1.0  TU  (Reference) 

Mean  TU  in 
Lauritzen  Channel 

TU  in  Santa  Fe 
Channel 

iDDTi 

371  //g/g  OC  (Nebeker  et  al.  1989) 

5.3 

0.24 

Dieldrin 

1,955 /ig/g  OC  (Hoke  and  Ankley  1991) 

0.012 

0.001 

PCB2 

2,600 //g/g  OC  (Swartz  et  al  1988) 

0.0026 

0.008 

IPAH^ 

XPAH  Model  (Swartz  et  al  1995) 

0.28 

1.7 

Metals  ^ 

SEM  -  AVS  (Di  Toro  et  al  1990) 

-52.8;/mol/g 

-106.7// mol/g 

Source:  Swartz  cffl/,  1994 

1  S  2,4'-DDE;  4,4'-DDE;  2,4'-DDD:  4,4'-DDD;  2,4'-DDT;  4,4'-DDT 
^  Aroclor  1254 

^  See  Figure  16-6  for  model  description. 

^  Cu,  Zn,  Pb,  Ni  and  Cd.  SEM  -  AVS  is  not  a  TU  analysis.  Negative  values  indicate  a  low  probability  of  effects  due  to 
metals.  See  Section  16.3. 


sensitive  benthic  invertebrates  (Figure  16-4;  Di  Toro  etal  1990, 1992;  Ankley,  etal  1991, 
1993;  Ankley,  Leonard,  and  Mattson  1994;  Ankley,  Thomas,  etal  1994;  Carlson  et  al 
1991;  Casas  and  Crecelius  1994;  Hare  et  al  1994;  Pesch  et  al  1995).  SEM  -  AVS  thus  pro¬ 
vides  a  geochemical  basis  for  predicting  the  cumulative  effects  of  mixtures  of  metals  in 
anaerobic  sediment.  If  SEM  -  AVS  >  0,  metal  concentrations  in  pore  water  will  increase 
greatly  and  toxicity  may  occur  unless  other  sediment  phases  {e.g.,  OC)  bind  the  metals. 

16.4  Sediment  toxicity  identification  evaluation  methods 

Toxicity  identification  evaluation  is  a  method  to  identify  contaminant  classes  and  possi¬ 
bly  individual  compounds  that  are  responsible  for  the  toxicity  exerted  by  effluent,  water 
or  sediment  samples  that  contain  complex  chemical  mixtures.  Toxicity  identification 
evaluation  was  originally  developed  for  effluents  and  later  applied  to  pore  water  extracted 
from  sediments  (Schubauer-Berigan  and  Ankley  1991).  This  overview  of  sediment  TIE 
procedures  is  based  on  the  recent  review  by  Ankley  and  Schubauer-Berigan  (1995).  Tox¬ 
icity  tests  by  themselves  cannot  identify  the  chemical(s)  that  cause  the  observed  effects. 
Toxicity  identification  evaluation  uses  toxicity-based  fractionation  procedures  to  impli¬ 
cate  specific  contaminants  as  causative  agents;  it  is  conducted  in  three  phases  to  charac¬ 
terize  (Phase  I),  identify  (Phase  II),  and  confirm  (Phase  III)  compounds  responsible  for 
observed  toxicity. 

Phase  I  characterizes  the  physical-chemical  properties  of  toxicants  in  a  sample  through 
manipulations  designed  to  alter  or  render  biologically  unavailable  generic  classes  of 
chemicals  with  similar  properties  (Ankley  and  Schubauer-Berigan  1995).  The  sample 
extractions  and  manipulations  of  Phase  I  are  shown  in  Figure  16-5.  Some  of  the  manipu- 
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♦  Long  Island  Sound  -  Cd  •  Long  Island  Sound  -  Ni 

♦  Mix-Cd  O  Ninigret  PD  -  Ni 

O  Ninigret  PD  -  Cd  •  Foundry  Cove  *  Cd  +  Ni 

Figure  16-4  Percent  mortality  of  amphipods  in  relation  to  the  molar  ratio  of  SEM  to  AVS  of  the 
sediment  (after  Di  Toro  et  al  1992) 


lations  are  the  addition  of  EDTA  to  implicate  cationic  metals;  solid  phase  extraction  of 
nonionic  compounds  in  C13  columns;  pH  manipulations  to  detect  toxicity  due  to  ammo¬ 
nia  and  other  ionic  chemicals;  and  aeration  to  diminish  volatile,  oxidizable,  or  sublatable 
compounds.  Changes  in  toxicity  after  these  treatments  indicate  the  possible  contribution 
of  different  chemical  classes. 

Phase  II  uses  a  combination  of  analytical  and  toxicological  methods  to  identify  specific 
chemicals  that  may  be  responsible  for  observed  toxicity  (Ankley  and  Schubauer-Berigan 
1995).  Phase  II  chemical  analyses  are  guided  by  the  results  of  Phase  1,  e.g.,  the  Phase  II 
analysis  would  focus  on  metals  if  EDTA  chelation  reduced  toxicity.  Toxicity  tests  may  be 
used  in  single  chemical  exposures  to  determine  if  the  measured  concentrations  in  the  TIE 
samples  are  sufficient  to  cause  toxicity. 

Phase  III  is  conducted  to  confirm  that  the  suspect  toxicants  identified  in  Phase  II  are,  in 
fact,  the  actual  toxicants  (Ankley  and  Schubauer-Berigan  1995).  The  procedures  used  in 
Phase  III  include  correlation  between  chemistry  and  toxicity,  evaluation  of  relative  spe¬ 
cies  sensitivity  to  different  chemicals,  behavioral  observations,  spiking  of  samples  or 
controls  with  suspect  toxicants,  mass  balance  of  removed  and  recovered  toxicity  by  filtra¬ 
tion  or  other  manipulations,  and  alteration  of  pH  or  other  water  characteristics  that  af¬ 
fect  the  toxicity  of  specific  chemicals.  Several  of  these  procedures  are  typically  used  to 
provide  the  weight  of  evidence  needed  for  a  Phase  III  confirmation.  Use  of  TIE  methods 
is  most  effective  when  a  single  or  limited  number  of  contaminants  is  responsible  for  ob- 
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Figure  16-5  Conceptual  overview  of  Phase  I  of  a  toxicity  identification  evaluation  (after  Ankley  and 
Schubauer-Berigan  1995) 


served  toxicity.  If  many  chemicals  are  involved,  it  can  be  difficult  to  discriminate  effects 
of  individual  contaminants  . 

The  adaptation  of  effluent  TIE  methods  to  sediments  was  initially  based  on  pore  water 
rather  than  whole  sediments  (Ankley  and  Schubauer-Berigan  1995).  Chemical  concentra¬ 
tions  in  pore  water  are  correlated  with  whole  sediment  toxicity  and  the  toxicity  of  ex¬ 
tracted  pore  water  can  be  tested  directly.  High  speed  centrifugation  without  subsequent 
filtration  appears  to  be  the  best  method  for  collecting  pore  water  (Schults  et  al  1992; 
Ankley  and  Schubauer-Berigan  1994).  Research  is  currently  directed  toward  the  applica¬ 
tion  of  TIE  methods  to  whole  sediments. 

16.5  Models  of  sediment-confaminant  interactions: 

IPAH  model 

The  SPAH  model  is  an  initial  attempt  to  predict  the  sediment  toxicity  of  mixtures  of  PAH 
compounds.  The  model  estimates  the  probability  of  toxicity  using  a  combination  of  EqP, 
QSAR,  TU,  additivity,  and  concentration-response  models  (Figure  16-6).  The  sediment 
concentration  of  OC  and  13  PAH  compounds  are  measured.  Porewater  concentrations 
(PAH|^)  of  the  13  compounds  are  predicted  by  EqP.  The  10-d  LC50  of  each  compound  in 
pore  water  (10-d  LCSOj^)  is  predicted  by  a  QSAR  regression  of  10-d  LCSOj^  (from  spiked 
sediment  tests)  to  (octanol-water  partitioning  coefficient).  Toxic  unit  concentrations 
of  individual  compounds  (TUj)  are  predicted  as  PAHj^/lO-d  LCSO^^.  The  total  number  of 
TUs  of  the  13  compounds  (ZTUj)  is  calculated  by  addition,  assuming  the  additivity  of 
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Figure  16-6  Outline  of  the  XPAH  model  to  predict  the  sediment  toxicity  of  PAH  mixtures  to  marine 
and  estuarine  amphipods  (from  Swartz  etai  1995) 


toxic  effects  of  PAHs.  ZTUj  is  used  to  predict  the  probability  of  toxicity  of  PAH  mixtures 
to  marine  and  estuarine  amphipods  using  a  concentration-response  model  derived  em¬ 
pirically  from  spiked  sediment  toxicity  tests. 

Verification  of  the  SPAH  model  was  based  on  comparison  of  predicted  and  observed  tox¬ 
icity  at  sites  where  PAHs  were  the  major  sediment  contaminants  and  at  sites  where  PAHs 
were  relatively  minor  contaminants  (Swartz  et  al  1995).  There  was  86.6%  correspon¬ 
dence  and  no  significant  difference  between  predicted  and  observed  toxicity  at  the  PAH- 
contaminated  sites.  Predicted  and  observed  toxicity  were  significantly  different  at  sites 
where  PAHs  were  not  the  principal  contaminants.  When  toxicity  was  observed  but  not 
predicted,  the  model  provided  a  useful  indication  of  the  relatively  minor  contribution  of 
PAHs  to  observed  effects. 

The  SPAH  model  has  a  number  of  limitations  and  sources  of  error  that  need  further  re¬ 
search  (Swartz  et  al  1995).  The  model  is  not  applicable  to  chemicals  whose  interaction  is 
antagonistic  or  synergistic.  The  model  does  not  account  for  changes  in  mechanisms  of 
toxicity  photoactivation  of  certain  PAH  compounds;  Ankley,  Collyard,  etal  1994). 
The  model  concerns  acute  (10-d)  amphipod  lethality.  With  the  development  of  chronic 
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amphipod  toxicity  tests,  the  model  could  be  revised  to  address  chronic  effects  and  popu¬ 
lation  dynamics.  Additional  data  on  10-d  porewater  LCSOs  are  needed  to  improve  the  ac¬ 
curacy  of  the  QSAR  regression  used  in  the  ZPAH  model.  The  principal  limitation  with 
respect  to  mixtures  is  the  present  restriction  of  the  model  to  13  PAH  compounds.  In 
theory,  the  model  could  be  expanded  to  include  all  nonionic,  narcotic  compounds  whose 
toxicological  interaction  is  additive. 

16.6  Models  of  sediment-contaminant  interactions: 

QSAR/EqP  model 

Several  unpublished  models  have  been  used  to  predict  acute  and  chronic  effects  of  mix¬ 
tures  of  narcotic  chemicals  in  sediments.  The  most  parsimonious  of  these  models  is 
based  on  species-specific  QSARs  between  aqueous  LCSOs  and  octanol-water  partitioning 
coefficients  (Kq^): 

log  LC50  =  a  log  +  b 

where 

a  =  slope, 
b  =  y-intercept,  and 
LC50  is  expressed  in  mol/ m^. 

Through  multilinear  regression  analysis,  a  common  slope  (a  =  -0,83)  was  established  for 
the  QSARs  of  18  aquatic  species.  At  the  y-intercept  of  the  QSAR  regression,  log  =  0 
(Kq^  =  1)  and  the  equilibrium  concentration  of  a  compound  in  octanol  is  equal  to  the 
equilibrium  concentration  of  a  compound  in  water  {i.e.,  log  LC50  =  b).  Since  octanol  is  a 
good  surrogate  for  lipid,  the  y-intercept  corresponds  to  the  lipid-normalized  LC50  of  a 
narcotic  compound.  The  y-intercept  was  therefore  identified  as  the  toxic  endpoint. 

Sediment  guidelines  could  be  established  to  protect  95%  of  aquatic  species  from  acute 
mortality.  The  lower  5th  percentile  in  the  frequency  distribution  of  toxic  endpoints  was 
15.9  mmol/L.  Thus,  an  acute  narcosis  baseline  toxicity  equation  can  be  derived: 

Log  LC505th  =  (-0.83)  log  -h  log  15.9. 

Equilibrium  partitioning  of  nonionic  organic  compounds  in  sediment  is  a  function  of  the 
sediment  OC  concentration  according  to  the  following  equation  (Adams  et  ah  1985;  Di 
Toro  etal  1991): 

Q  “  ^DC  ^oc 

where 

Cj  =  sediment  concentration  (kg  chem/kg  dry  weight), 

=  particle  OC  weight  fraction  (kg  OC/kg  dry  weight), 

Kqc  =  particle  OC  partition  coefficient,  (L/kg  organic  carbon)  [Kq^  ~  Kq^],  and 
Cd  =  dissolved  concentration  in  pore  water  (kg  chemical/L). 
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The  EqP  equation  can  be  used  to  predict  the  sediment  concentration  at  which  the  freely 
dissolved  porewater  concentration  would  equal  the  acute  narcosis  baseline  concentra¬ 
tion;  />,,  if  Cjj  =  LC505f|^,  the  EqP  equation  becomes 

^s,5th  “  ^oc  ^oc 

where 

Cs  5th  =  sediment  concentration 

when 

Cd  =  LCSOsth. 

The  SQG  for  an  individual  chemical  can  be  expressed  as  the  OC-nofmalized  sediment 
concentration  that  is  not  acutely  toxic  to  95%  of  the  species: 

SQG  =  C,5th/foc  =  K,,LC505th. 

Since  the  toxicological  interaction  of  type  I  narcotic  chemicals  is  additive,  a  sediment 
guideline  for  a  mixture  of  n  such  chemicals  can  be  expressed  as  the  sum  of  the  fraction  of 
the  OC-normalized  sediment  concentration  divided  by  the  SQG  for  each  chemical: 

^Q-^mixture  “ 
i=l 

When  SQGfnixture  ^  the  total  concentration  of  type  I  narcotic  chemicals  in  sediment  is 
expected  to  exceed  the  level  that  will  protect  95%  of  species  from  acute  effects.  A  chronic 
SQG  can  be  calculated  by  assuming  an  acute-chronic  ratio  of  10  for  each  chemical  in  the 
mixture. 

The  initial  conceptual  development  of  the  QSAR/EqP  model  of  sediment  contaminant 
interactions  has  been  completed.  Experimental  analysis  and  field  verification  are  needed 
to  assess  the  efficacy  of  this  model. 

16.7  Sediment  guidefines  for  contaminant  mixtures 

Sediment  toxicity  tests  are  the  principal  tools  currently  used  in  regulatory  programs  to 
assess  the  effects  of  mixtures  of  sediment  contaminants.  The  major  strength  of  toxicity 
tests  is  that  they  reflect  the  cumulative  interaction  and  toxic  effect  of  all  contaminant  and 
other  stresses  imposed  by  sediment  samples  on  test  species.  Their  major  weakness  is  that 
they  cannot  be  used  alone  to  identify  the  factors  responsible  for  the  toxic  response. 

Most  numerical  SQGs,  including  the  proposed  USEPA  (1993a,  1993b,  1993c)  sediment 
quality  criteria,  have  been  developed  for  single  compounds.  However,  a  few  guidelines 
have  been  established  for  mixtures  of  compounds  within  chemical  classes,  especially 
PAHs  (Table  16-2).  The  sediment  mixture  guidelines  were  developed  using  effects  range 
(ER)  (Long  et  al  1995)  and  apparent  effects  threshold  (AET)  (Washington  State  Depart¬ 
ment  of  Ecology  1990;  PTI  1991a,  1991b)  methods  applied  to  the  sum  of  the  concentra¬ 
tions  of  compounds  in  a  particular  chemical  class.  For  example,  the  AET  guideline  for  low 
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Sediment  concentration 


Mixture 

(wgAg,  dry  weight) 

mg/kg  OC 

ERL^ 

ERM  2 

AET^ 

Microtox 

AET 

Amphipod 

WASQS^ 

WACSL® 

LPAH® 

552 

3,160 

5,200 

24,000 

370 

780 

HPAH  7 

1,700 

9,600 

12,000 

69,000 

960 

5,300 

TPAH® 

4,022 

44,792 

TDDT^ 

1.58 

46.1 

TpcbIO 

22.7 

180 

130 

3,100 

^  Effects  range  -  Low  (Long  et  al.  1995) 

^  Effects  range -Median  (Long  cffl/.  1995) 

^  Apparent  effects  threshold  for  amphipod  and  Microtox  toxicity  tests  (PTI  1991a) 

^  State  of  Washington  Sediment  Quality  Standards  (PTI  1991b,  Washington  State  Department  ofEcology  1990) 

^  State  of  Washington  Cleanup  Screening  Level  (PTI  1991b,  Washington  State  Department  ofEcology  1990) 

®  Low  molecular  weight  PAHs 
^  High  molecular  weight  PAHs 
®  Total  PAHs 

^  Total  DDT  and  its  metabolites 
10  Total  PCBs 


molecular  weight  PAHs  (LPAH)  was  based  on  the  sum  of  the  concentrations  of  naphtha¬ 
lene,  acenaphthylene,  acenaphthene,  fluorene,  phenanthrene,  anthracene,  and  2-meth- 
ylnaphthalene  in  individual  sediment  samples. 

The  AET  and  ER  mixture  guidelines  average  about  75%  of  the  sum  of  the  individual 
guidelines  (Washington  State  Department  of  Ecology  1990;  PTI  1991a,  1991b;  Lon^etal 
1995).  In  the  case  of  the  Cleanup  Screening  Level  of  the  State  of  Washington  (WACSL, 
Table  16-2),  the  mixture  guideline  for  high  molecular  weight  PAHs  (HPAH  =  5,300  mg/ 
kg  OC),  actually  exceeds  the  sum  of  the  guidelines  for  the  nine  individual  HPAH  (S  = 
4,189  mgAg  OC;  PTI  1991a,  1991b).  Similarly,  the  ERL  and  ER-median  (ERM)  guidelines 
for  total  PAH  exceed  the  sum  of  the  guidelines  for  LPAH  and  HPAH  (Long  et  al.  1995). 
This  suggests  that  sediment  contaminants  act  independently  or  antagonistically  to  one 
another,  a  conclusion  that  is  inconsistent  with  evidence  supporting  the  additivity  of  toxic 
effects  of  PAHs,  metals,  and  other  sediment-associated  chemicals  in  sediment  (Swartz  et 
al  1988, 1995;  Di  Toro  et  al  1992;  Ankley,  Thomas,  et  al  1994).  If  cumulative  effects  are 
additive,  the  concentration  of  individual  chemicals  in  a  sediment  that  exceeds  the  mix¬ 
ture  guideline  should  be  substantially  less  than  the  guideline  concentration  for  the  indi¬ 
vidual  chemical  by  itself. 
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There  are  two  explanations  for  this  apparent  paradox.  First,  the  PAH  measurements  from 
which  the  mixture  guidelines  were  derived  may  include  concentrations  of  compounds  for 
which  individual  guidelines  were  not  developed.  Second,  because  much  (in  some  cases, 
all)  of  the  data  used  to  derive  PAH  guidelines  were  collected  from  field  samples,  the  AET 
and  ER  guidelines  for  individual  PAHs  may  reflect  the  effects  of  covarying  chemicals,  in¬ 
cluding  other  PAHs.  If  so,  all  of  the  guidelines  for  individual  PAHs  do  not  represent  the 
effects  of  single  chemicals  by  themselves.  Rather,  an  individual  PAH  guideline  represents 
the  concentration  of  a  single  chemical  in  an  unknown  mixture  of  covarying  chemicals 
whose  cumulative  toxicity  is  sufficient  to  exert  adverse  effects. 

16.8  Research  issues 

We  have  identified  a  number  of  major  research  needs  concerning  the  development  and 
verification  of  methods  to  predict  effects  of  mixtures  of  contaminated  sediments.  They 
including  the  following: 

•  Causality/ correlation  discrimination  among  covarying  factors :  Effects  attrib¬ 
uted  to  one  chemical  may  actually  be  caused  by  another,  covarying  chemical. 
This  issue  must  be  resolved  before  reliable  guidelines  for  single  chemicals  can  be 
developed  from  field-collected  sediment  data. 

•  QSAR  models  for  sediment  toxicity:  Accurate  QSAR  regressions  are  integral 
components  of  several  models  to  predict  toxicity  of  sediment  contaminant 
mixtures.  The  QSARs  should  be  developed  directly  from  porewater  LCSOs. 

•  Additivity  hypothesis:  The  assumption  of  additivity  of  toxic  effects  of  narcotic 
and  other  chemicals  in  sediment  must  be  verified.  The  interaction  of  nonaddi¬ 
tive  chemicals  is  poorly  understood. 

•  Concentration-response  model:  Quantification  of  concentration-response 
relations  is  necessary  to  predict  toxic  effects. 

•  Toxicity  identification  evaluation  methods  for  whole  sediments:  TIE  methods 
need  to  be  adapted  to  whole  (solid  phase)  sediments  to  improve  the  direct 
relevance  of  TIE  to  standard  sediment  toxicity  tests. 

•  Chronic  effects  of  sediment  contaminant  mixtures:  Initial  research  on  the 
toxicity  of  mixtures  of  sediment  contaminants  has  largely  been  restricted  to 
acute  effects.  Chronic  effects  on  growth,  reproduction,  and  population  dynam¬ 
ics  need  to  be  investigated. 

•  Interactions  between  chemical  contaminants  and  other  stresses:  Noncontami¬ 
nant  stress  factors  like  grain  size  and  salinity  can  interact  with  chemical 
contaminants  and  affect  biological  responses.  Effects  of  noncontaminant 
stresses  should  be  examined  carefully,  especially  during  chronic  exposures. 

•  Field  verification  of  mixture  model:  Models  of  the  interactions  of  contaminant 
mixtures  have  to  be  verified  in  relation  to  in  situ  biological  and  toxicological 
effects  of  contaminated  sediments. 
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17.1  Introduction 

In  this  chapter,  a  range  of  issues  related  to  the  uncertainty  associated  with  SERAs  is  de¬ 
scribed,  including  an  evaluation  of  1)  uncertainty  associated  with  the  overall  SERA  frame¬ 
work,  2)  the  effects  of  false  positive  and  false  negative  errors  associated  with  sediment 
toxicity  tests,  3)  spatial  and  temporal  distributions  of  sediment  contamination,  4)  sam¬ 
pling  errors,  and  5)  uncertainties  associated  with  transport  and  fate  models.  Chapter  18 
describes  the  uncertainty  associated  with  specific  measurement  endpoints  commonly 
used  in  SERAs  and  discusses  approaches  for  addressing  these  sources  of  uncertainty. 

The  goal  of  any  uncertainty  analysis  is  to  describe  and  interpret  knowledge  limitations 
that  may  be  present  in  the  measurement  endpoints  used  to  conduct  a  SERA  analysis,  for 
the  purpose  of  incorporating  estimates  of  uncertainty  into  management  decisions.  A 
number  of  viewpoints  were  discussed  at  the  Workshop  for  defining  uncertainty,  two  of 
which  are  described  below. 

In  the  first  viewpoint,  uncertainty  is  considered  to  be  composed  of  two  components:  1) 
measures  of  bias  (i.e.,  consistent  deviation  of  measured  values  from  the  true  value)  and  2) 
measures  of  precision  {i.e.,  measure  of  agreement  among  replicable  analyses  of  a  sample). 
Accuracy  is  the  combination  of  bias  and  precision  for  a  procedure  which  reflects  the 
closeness  of  a  measured  value  to  a  true  value.  Figure  17-1  presents  a  visual  interpretation 
of  these  components  of  uncertainty  (Jessen  1978).  Note  that  bias  and  precision  are  inde¬ 
pendent.  For  example,  a  method  could  have  low  bias  and  low  precision  (Figure  17-lb),  or 
high  bias  and  low  precision  (Figure  17-lc).  Either  combination  leads  to  a  decline  in  the 
overall  confidence  in  the  measurement. 

Strictly  defined  estimates  of  accuracy  are  limited  to  formal  experiments  such  as  interlabo¬ 
ratory  testing  of  a  blind  (but  known)  chemical  concentration.  In  contrast,  many  sediment 
surveys  are  conducted  without  the  benefit  of  knowing  the  “true”  value  {i.e.,  accuracy  of 
sediment  toxicity  tests  with  field- collected  sediments).  In  these  cases,  estimates  of  field 
precision  are  limited  and  a  weight-of-evidence  is  used  as  a  surrogate  to  estimating  bias. 
Strictly  defined,  precision  is  the  observed  variance  of  repeated  measurements  conducted 
under  the  same  conditions  {e.g.,  the  variance  associated  with  repeated  ponar  grabs  at  the 
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(a)  High  Bias  +  Low  Precision  =  Low  Accuracy; 

(b)  Low  Bias  +  Low  Precision  =  Low  Accuracy; 

(c)  High  Bias  +  High  Precision  =  Low  Accuracy; 

(d)  Low  Bias  +  High  Precision  =  High  Accuracy, 


Figure  17-1  Data  precision,  accuracy,  and  bias  represented  by  shot  patterns  in  targets  (after  Jessen 


same  location).  In  practice  however,  biological  and  chemical  properties  are  very  dynamic, 
making  rigorous  estimates  of  bias  and  precision  difficult  to  obtain  (see  Section  17.2.4.3). 

In  the  second  viewpoint,  uncertainty  can  be  evaluated  in  the  context  of  expert  judgment 
and  opinion  in  the  analysis  of  uncertainty.  While  determination  of  accuracy  and  preci¬ 
sion  of  management  tools  provides  direct  information  for  evaluating  uncertainty,  many 
methods  are  not  amenable  to  this  type  of  assessment.  For  this  reason,  less  quantitative 
methods  are  often  used  to  evaluate  uncertainty,  such  as  expert  judgment  (see  Chapter 
18).  Although  we  may  not  have  definitive  numerical  measurements  on  the  ecological  rel- 
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evance  of  a  specific  measurement  endpoint,  a  well-designed  expert  opinion  survey  can  be 
used  to  generate  knowledge  relevant  to  the  issue.  Similar  to  numerical  analysis,  the  larger 
the  opinion  survey  the  greater  the  information  we  have  to  assess  uncertainty.  In  practice, 
expert  opinion  may  be  more  available  than  well-conducted  numerical  analyses  of  uncer¬ 
tainty  and  can  be  a  useful  source  of  information.  A  large  statistical  literature  is  available 
on  methods  for  generating  expert  opinion  in  a  formal  analysis  of  uncertainty  {i.e.,  Bayes 
theory;  Chapter  13).  Bayes  theory  can  be  used  to  combine  both  subjective  and  quantita¬ 
tive  sources  of  information  in  a  decision-making  process.  In  the  following  sections,  we 
use  both  of  the  viewpoints  described  above  to  discuss  the  sources  of  uncertainty  and  the 
implication  of  uncertainty  in  SERAs.  We  encourage  scientists  and  policy-makers  to  con¬ 
sider  uncertainty  in  risk-based  decisions.  We  hope  that  by  addressing  the  uncertainty 
issues,  decision-makers  will  have  valuable  information  available  for  weighing  the  various 
options  available  for  risk  reduction. 

17.2  Uncertainty  in  the  risk  assessment  framework 

Guidelines  have  been  developed  for  ERAs  to  promote  consistency  in  analysis  (Chapter  1). 
These  guidelines  also  allow  for  the  establishment  of  quality  standards  and  consistent 
terminology  for  assessments.  Consistency  in  the  use  of  guidelines  can  help  inform  all 
stakeholders  as  to  the  relative  degree  of  confidence  and  scientific  knowledge  under  which 
the  decision  was  made  (Russell  and  Gruber  1987).  Several  guidelines  are  in  use  with  vary¬ 
ing  degrees  of  consistency.  Many  of  these  methods  are  based  on  similar  procedures  and 
principles;  therefore,  the  USEPA  Framework  for  Ecological  Risk  Assessment  (USEPA 
1992a)  was  used  in  this  chapter  as  a  guideline  (Chapter  1;  Figure  1-1).  Each  of  these  areas 
is  discussed  in  more  detail  below. 

17.2.1  Problem  formulation 

Problem  formulation  is  the  planning  or  experimental  design  stage  of  the  overall  risk  as¬ 
sessment  process.  Uncertainty  in  the  formal  statistical  sense  is  of  lesser  importance  at  this 
stage  of  the  process;  however,  there  is  qualitative  uncertainty  in  the  appropriate  choice  of 
assessment  endpoints  (objectives  or  purposes  of  the  risk  assessment)  and  measurement 
endpoints  (indicators  or  tools  used  to  evaluate  risk  or  effects).  The  best  way  to  reduce 
these  initial  uncertainties  in  the  problem  formulation  is  to  involve  all  interested  parties 
through  stakeholder  input.  This  involves  asking  all  interested  parties  (including  the  risk 
managers,  the  scientific  community,  and  the  public)  to  define  the  problem  in  the  form  of 
a  concise  narrative.  Once  the  problem  has  been  identified,  appropriate  assessment  and 
measurement  endpoints  can  be  selected.  This  chapter  focuses  primarily  on  uncertainty 
in  relation  to  evaluating  effects  of  chemical  stressors.  However,  nonchemical  stressors 
could  be  a  dominant  process  influencing  the  system  {e.g.,  habitat  disturbance)  or  other 
chemical  or  nonchemical  stressors  potentially  will  interact  to  produce  perturbations  {e.g., 
ammonia  and  dissolved  oxygen  in  the  lower  Fox  River,  Ankley  et  al  1992;  temperature 
and  metals  in  the  Clark  Fork  River,  Kemble  et  al.  1994). 
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In  the  case  of  retrospective  risk  assessments  (in  some  instances,  termed  impact  assess¬ 
ments),  identification  of  the  stressors  is  a  potential  source  of  uncertainty.  Identification  of 
stressors  should  be  part  of  the  problem  formulation  stage  and  typically  consists  of  1)  a 
survey  of  the  natural  and  anthropogenic  stressors  which  may  be  associated  with  the  test 
area,  2)  an  assessment  of  the  available  data  relative  to  quality  control  and  quality  assur¬ 
ance,  and  3)  hypothesizing  potential  stressors.  In  some  cases,  it  may  be  necessary  to 
make  use  of  physical  and  chemical  separation  techniques  (e.g.,  toxicity-based  fraction¬ 
ation  methods)  to  identify  specific  classes  of  contaminant  stressors.  For  example,  extrac¬ 
tion  of  pore  water  from  sediment  may  allow  partial  identification  of  potential  chemical 
stressors  through  TIE  methods  which  use  physical-chemical  manipulations  to  affect  the 
toxicity  of  specific  contaminants  of  concern  (Chapter  18).  Uncertainties  also  exist  in  the 
identification  of  both  assessment  and  measurement  endpoints,  an  area  where  good  pro¬ 
fessional  judgment  is  valuable  (Chapter  7;  section  7.4). 

17.2.2  Exposure  characterization 

The  analysis  phase  of  the  risk  assessment  framework  consists  of  two  activities:  character¬ 
ization  of  exposure  and  characterization  of  effects  (Chapter  1;  Figure  1-1).  The  purpose  of 
characterization  of  exposure  is  to  predict  or  measure  the  spatial  and  temporal  distribu¬ 
tion  of  a  stressor  and  its  co-occurrence  or  contact  with  the  ecological  components  of  con¬ 
cern  (USEPA  1992a).  These  uncertainties  may  influence  the  planning,  execution,  or 
interpretation  stage  of  the  exposure  characterization.  Primary  sources  of  uncertainty  in 
characterizing  exposure  include  1)  laboratory  imprecision,  2)  matrix  interference  errors, 
3)  sample  location  biases,  4)  sample  collection  and  handling  errors,  5)  phase  distribution 
of  stressor,  6)  contamination  of  the  sample  with  other  stressors,  7)  spatial  and  temporal 
heterogeneity  of  the  stressor,  8)  references  for  comparison  of  stressor  levels,  9)  substrate 
type  and  interactions  with  stressor,  10)  life  history  of  the  organism,  11)  nonequilibrium 
of  chemical  stressor  between  sediment  and  water,  12)  response  model  prediction  error, 
13)  data  transformation  and  normalization  errors,  14)  exposure  pathway  analysis  errors, 
15)  sediment  transport  modeling  assumptions,  and  16)  fate  analyses  errors. 

17.2.3  Effects  characterization 

The  purpose  of  characterization  of  effects  is  to  identify  and  quantify  the  adverse  effects 
resulting  from  exposure  to  a  stressor  (USEPA  1989).  The  16  areas  of  uncertainties  listed 
above  for  exposure  characterization  may  also  influence  the  planning,  execution,  or  inter¬ 
pretation  stage  of  the  effects  characterization.  Additional  sources  of  uncertainty  in  effects 
characterization  may  include  1)  effects  of  noncontaminant  stressors  in  toxicity  tests,  2) 
reference  comparisons  to  toxicity  or  receptor  distributions,  3)  laboratory-to-field  ex¬ 
trapolations,  4)  interpretation  and  definition  of  natural  variability,  5)  differences  in  re¬ 
ceptor  species  sensitivity,  6)  differences  in  physical  alterations  of  the  sediment,  and  7) 
differences  in  stressor-response  relationships. 

17.2.4  Risk  characterization 

Risk  characterization  may  either  be  prospective  {e.g,  product  hazard  assessment  as  de¬ 
scribed  in  Chapters  3  and  4)  or  retrospective  (e.g,  impact  hazard  assessment  as  described 
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in  Chapters  6  and  7).  Risk  characterization  at  the  organismal  level  has  traditionally  been 
done  by  comparison  of  the  concentration  of  the  stressors  found  in  the  environment  to 
the  responses  reported  for  those  stressors  in  the  laboratory,  field,  or  by  use  of  the  litera¬ 
ture.  This  risk  characterization  can  be  performed  as  described  in  the  following  sections. 

1 7.2.4. 1  Use  of  quotients  for  risk  assessment 

Risk  quotients  are  simple  ratios  of  exposure  and  effects.  For  example. 

Risk  a  =  Exposure  concentration 
Effect  concentration 

Traditionally,  the  quotient  method  has  been  used  to  compare  the  effect  concentrations 
for  the  most  sensitive  species  of  concern  to  the  average,  median,  mean,  or  highest  expo¬ 
sure  concentration.  In  addition,  these  exposure  concentrations  may  be  compared  to  an 
effect  concentration  derived  from  toxicity  tests.  This  assessment  can  be  made  more  con¬ 
servative  by  the  use  of  safety  (application)  factors,  such  as  division  of  the  effect  level  by  a 
number  such  as  20  (Canadian  Water  Quality  Guidelines  [CWQG]  1987).  Use  of  safety 
factors  allows  for  unquantified  uncertainty  in  the  effect  and  the  exposure  estimations  or 
measurements.  Because  this  uncertainty  is  unknown  and  unquantifiable,  substantial 
errors  are  possible,  both  in  underestimating  or  overestimating  the  risk. 

In  the  absence  of  sufficient  information  from  toxicity  tests,  these  risk  assessments  may  be 
underprotective.  Conversely,  where  a  wide  range  of  toxicity  data  is  available,  the  variation 
in  receptor  response  may  be  well  defined  and  further  use  of  safety  factors  may  be  over- 
protective.  Use  of  the  quotient  approach  is  acceptable  for  early  tiers  of  the  risk  assess¬ 
ment,  but  the  approach  fails  to  consider  the  range  of  variation  which  may  exist  in  terms 
of  exposures  and  susceptibility  (/>.,  Chapter  5  dealing  with  dredging  assessments).  Re¬ 
cently  a  method  was  proposed  for  using  quotients  in  Tier  I  risk  assessments,  which  in¬ 
cluded  the  incorporation  of  uncertainty  in  both  the  numerator  and  denominator  of  the 
quotient  equation  (Parkhurst  etaL  1995). 

77.2.4.2  Probabilistic  risk  assessment 

A  second  approach  for  evaluating  risk  is  to  express  the  results  of  a  refined  risk  character¬ 
ization  analysis  as  a  distribution  of  toxicity  values  rather  than  a  single  point  estimate  (i.e., 
Chapters  3  and  4  dealing  with  product  assessment).  For  example,  this  approach  has  been 
proposed  or  is  now  being  used  by  the  Dutch  government  (Health  Council  of  the  Nether¬ 
lands  1993),  Cardwell  et  al  (1993),  Baker  et  al  (1994),  Solomon  et  al  (1996),  and  Klaine 
et  al  (1996).  A  major  advantage  of  the  probability  approach  is  that  it  uses  all  relevant 
single-species  toxicity  data  and,  when  combined  with  exposure  distributions,  allows  for 
quantitative  estimation  of  the  risks  to  receptors.  However,  the  approach  is  only  valid  if 
endpoints  used  in  the  assessment  are  similar.  For  example,  survival  data  would  not  be 
expected  to  be  protective  of  reproductive  effects.  The  degree  of  overlap  of  the  exposure 
curve  (drawn  as  a  log-Pearson  Type  III  distribution;  McBean  and  Rovers  1992)  with  the 
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effects  curve  can  be  used  to  estimate  the  probability  that  a  certain  percentage  of  receptors 
may  be  adversely  affected  for  a  percentage  of  occasions  {i.e.,  Figure  17-2).  A  similar  ap¬ 
proach  has  been  used  in  the  derivation  of  USEPA  Water  Quality  Criteria  (Stephan  et  al 
1985).  With  the  use  of  overlapping  distributions,  there  is  an  implicit  assumption  that 
protecting  a  certain  percentage  of  species  for  a  certain  proportion  of  occasions  will  also 
preserve  ecosystem  structure  and  function. 

Although  this  approach  to  risk  characterization  takes  into  account  much  of  the  variabil¬ 
ity  with  regard  to  the  range  of  susceptibility  in  receptor  species,  it  still  embodies  several 
uncertainties  and  limitations.  For  example,  the  choice  of  protection  level  {e.g.,  90%  of 
species)  may  not  be  socially  acceptable.  Some  may  view  90%  as  being  overprotective, 
whereas  others  may  find  this  level  of  risk  unacceptable,  especially  if  the  10%  of  potentially 
affected  species  includes  endangered  species  or  other  organisms  of  ecological,  commer¬ 
cial,  or  recreational  importance  (see  Chapter  11).  Additionally,  risks  of  persistent, 
bioaccumulative  chemicals  to  species  at  the  top  of  the  food  chain  may  not  be  sufficiently 
addressed  by  this  approach  (Baker  1994;  Section  18.5).  In  the  situation  where  there 
is  a  desire  to  protect  more  sensitive  receptors,  these  species  could  be  identified  and  ap¬ 
propriate  mitigation  measures  taken. 

A  further  issue  requiring  consideration  in  probabilistic  risk  assessments  is  the  number  of 
data  points  required  to  define  the  distribution  of  receptor  species  for  either  acute  or 
chronic  effects.  Additional  test  species  and  endpoints  beyond  those  now  applied  for 
SERAs  may  be  needed  (Burton  and  Ingersoll  1994).  In  addition,  there  is  a  need  for  meth¬ 
ods  such  as  those  proposed  by  Parkhurst  et  al  (1995)  for  calculating  the  degree  of  risk 
associated  with  exposures  to  multiple  chemicals. 

77.2.4.3  Retrospective  risk  assessment 

Risk  assessments  based  on  measurement  of  current  conditions  are  considered  retrospec¬ 
tive  and  typically  do  not  forecast  the  expected  change  in  risk  due  to  remedial  or  mitiga¬ 
tory  options  or  changes  in  the  ecosystem  in  the  future.  Retrospective  risk  assessments 
rely  on  a  number  of  techniques  discussed  in  more  detail  in  Chapter  5  (dredging  assess¬ 
ment)  and  Chapters  6  and  7  (site  cleanup  assessment).  These  assessments  may  include 
measurement  endpoints  such  as  sediment  toxicity  tests,  assessments  of  structural  or 
functional  changes  in  the  benthic  communities,  cellular  and  molecular  effects  in  the  re¬ 
ceptor  species,  or  the  presence  of  tissue  residues  of  contaminants  of  concern  (Chapter 
18). 

The  use  of  multiple  lines  of  evidence  (weight  of  evidence)  is  particularly  important  in 
retrospective  risk  assessment  (USEPA  1992b)  and  may  also  be  useful  in  prospective 
analysis.  For  example,  the  probability  that  an  effect  on  benthic  community  structure  is 
the  result  of  exposure  to  a  chemical  stressor  is  made  more  certain  if  the  concentration  of 
the  stressor  in  the  area  is  high  enough  to  have  caused  the  observed  effect  and  also  results 
in  overt  toxicity  in  laboratory  toxicity  testing. 
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Figure  17-2  Graphical  representation  of  use  of  probabilistic  risk  assessment  with  sediments.  Cumula¬ 
tive  frequency  distributions  of  concentrations  of  stressors  in  sediments  are  compared  with 
distributions  of  sensitive  benthic  organisms.  Arrows  show  probabilities  of  not  exceeding 
10th  percentile  sensitivity  concentrations  for  acute  and  chronic  endpoints  at  3  sites 
(adapted  from  Solomon  etal.  1996). 

17.2.5  Risk  management 

The  outcome  of  all  risk  management  actions  should  be  either  the  acceptance  or  the  re¬ 
duction  of  the  risk.  Risk  reduction  involves  many  potential  actions  which  range  from  the 
technical  through  the  socioeconomic  to  the  political.  In  undertaking  risk  management, 
it  is  necessary  to  take  these  actions: 

•  Decide  which  risks  must  be  managed  and  in  what  priority.  This  requires  that 
some  method  for  measuring  and  comparing  risks  must  be  available  (e.g., 

Chapter  11  on  ecological  relevance). 

•  Maximize  the  reduction  of  risk  for  the  available  resources.  This  implies  that  a 
system  must  be  in  place  for  assessing  the  degree  of  risk  reduction  and  for 
measuring  its  cost. 

17.2.5.7  Uncertainty  in  prioritizing  risks 

In  general,  the  first  step  in  ranking  risks  for  management  involves  evaluation  of  the  harm¬ 
ful  effects  of  the  action  associated  with  the  production  or  release  of  the  stressor.  In  the 
case  of  human  health,  this  response  may  be  expressed  as  a  numerical  risk.  Even  though 
the  risk  assessment  process  may  have  limitations,  estimates  of  relative  risk  may  be  com¬ 
parable  if  similar  processes  are  used  to  derive  the  risks.  An  additional  difficulty  is  pre¬ 
sented  by  unquantifiable  risks.  This  applies  particularly  to  environmental  risk  which  may 
have  measurement  endpoints  of  an  aesthetic  nature,  such  as  reduced  days  of  recreational 
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fishing  or  reduced  view.  Endpoints  of  this  type  cannot  be  quantified  in  the  same  terms  as, 
for  example,  fish  mortality. 

Harwell  et  al  (1992)  proposed  a  method  for  evaluating  and  prioritizing  risk  to  human 
health  and  the  environment.  The  system  is  based  on  recognition  of  the  issues  raised  ear¬ 
lier  in  this  document  including  the  following: 

•  Acknowledging  that  ecosystems  are  diverse 

•  Knowing  that  ecosystems  respond  to  stress  differently  and  that  this  response  is 
governed  by  the  type  of  ecosystem  and  the  type  of  stressor 

•  Recognizing  that  a  wide  range  of  temporal,  organizational,  and  spatial  scales  are 
involved 

•  Knowing  that  the  measurement  endpoints  are  relevant  to  the  selected  assess¬ 
ment  endpoints 

•  Knowing  the  normal  baseline  behavior  of  the  ecosystem 

•  Having  good  extrapolation  techniques  from  laboratory  and  field  measurement 
endpoints  to  the  selected  assessment  endpoints 

•  Considering  uncertainty  in  all  of  these  issues 

The  risks  to  be  prioritized  are  then  separated  into  a  series  of  components,  which  are 
ranked  as  follows: 

•  The  potential  magnitude  of  the  risk.  Magnitude  is  ranked  on  an  ordinal  scale  of 
5,  ranging  from  low  to  high  as  follows:  Low  <  Medium  <  High  <  Very  High  < 
Extremely  High. 

•  The  geographic  extent  of  the  risk.  Extent  is  ranked  on  an  ordinal  scale  of  3, 
ranging  from  low  to  high  as  follows:  Local  <  Regional  <  Biosphere. 

•  The  recovery  time.  Recovery  time  is  ranked  on  an  ordinal  scale  of  3,  ranging 
from  low  to  high  as  follows:  Short  (years)  <  Medium  (decades)  <  Long  (centu¬ 
ries). 

These  scores  can  then  be  combined  and  used  for  ranking  purposes.  However,  because 
these  ranks  are  based  on  expert  assessment,  they  are  subject  to  uncertainty  and  bias.  As 
suggested  above  for  problem  formulation,  uncertainty  of  qualitative  assessments  may  be 
reduced  by  involving  expert  opinion  polls  and  the  stakeholders  in  the  process. 

17,2.5,2  Uncertainty  in  assessing  risk  reduction  strategies 

Many  options  for  risk  reduction  may  be  available  to  the  risk  manager;  however,  there  are 
generally  two  types  of  tools:  technological  and  regulatory. 

Technological  tools  for  risk  mitigation  include  a  wide  range  of  procedures,  many  of  which 
are  specific  to  the  situation.  In  the  case  of  sediments  which  are  contaminated  by  effluent 
discharges,  further  treatment  of  the  effluent  before  release  is  commonly  applied  in  indus¬ 
trial  settings.  In  the  case  of  in  situ  contamination  of  sediments,  many  cleanup  and  dis¬ 
posal  options  are  available  (Francinques  etal  1985;  International  Joint  Commission  [IJC] 
1988)  once  sediment  has  been  identified  as  containing  chemicals  at  concentrations  pos- 
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ing  a  problem.  The  sediment  can  either  be  removed,  stabilized,  capped,  treated  in  situ,  or 
“no  action”  may  be  taken  (Lynam  etal  1987;  Grigalunas  and  Opaluch  1989).  The  reme¬ 
diation  procedure  or  combination  of  procedures  chosen  is  specific  to  the  study  area  and 
depends  on  ecological,  chemical,  physical,  engineering,  economic,  human  health,  and 
political  considerations.  Furthermore,  source  control  and  continued  monitoring  must  be 
included  with  any  remediation  effort  to  avoid  creation  of  new  problem  areas. 

The  regulatory  tools  which  may  be  used  for  risk  mitigation,  in  increasing  order  of  effec¬ 
tiveness,  are  as  follows: 

•  Provide  better  information  and  communication  to  prevent  misuse  of  stressors 
that  may  contaminate  sediments. 

•  Better  control  discharges  and  releases  of  stressors  to  levels  which  are  judged  to 
present  a  tolerable  risk  to  the  benthic  community. 

•  Restrict  the  use  and  application  of  the  stressor. 

•  Impose  a  total  manufacturing  ban  on  the  stressor. 

Uncertainties  exist  which  affect  the  selection  of  technological  or  regulatory  tools  that 
should  be  used  to  mitigate  risks.  Uncertainty  of  knowledge  (which  technological  options 
are  available)  is  best  addressed  through  expert  opinion  surveys  and  stakeholder  consul¬ 
tations.  Uncertainties  in  the  degree  of  risk  reduction  are  best  assessed  by  reiterating  the 
risk  assessment  procedure  for  all  the  appropriate  exposure  reduction  strategies  and  then 
ranking  these  in  terms  of  both  the  reductions  in  risk  and  the  uncertainty  in  achieving 
these  reductions.  This  matrix  will  allow  informed  choices  to  be  made  and  the  “trading 
off”  of  costs  with  risk  reductions  and  the  uncertainties  of  achieving  these  reductions. 

77.2.5.3  Uncertainty  in  assessing  societal  values 

If  ecosystems  are  viewed  as  providing  services  to  society,  these  services  can  be  assessed  to 
have  an  economic  value.  Ail  components  of  the  ecosystem  can  be  assigned  an  economic 
value;  however,  this  view  has  been  criticized,  particularly  in  the  assigning  of  value  to  con¬ 
cepts  such  as  species  richness  and  diversity.  In  assigning  economic  value  to  ecological 
services,  the  implication  is  that  these  services,  including  physical  capital  (equipment  and 
technology)  or  human  capital  (knowledge  and  skills),  are  interchangeable  and  can  be 
traded  in  the  same  way  as  these  commodities,  for  example,  writing  off  the  loss  of  a  spe¬ 
cies  for  an  increase  in  copper  production.  In  addition,  assignment  of  economic  value  is 
often  restricted  to  only  a  few  components  of  the  system  at  risk  and  may  ignore  temporal 
and  spatial  interconnectedness  of  organisms,  populations,  and  ecosystems.  Uncertainty 
with  respect  to  economic  issues  which  should  be  considered  is  as  follows  (Harwell  et  al 
1992): 

•  Sustainability:  Irreversible  resource  damage  will  undermine  the  sustainability  of 
ecosystems  (and  by  extension,  human  society).  Thus,  irreversible  damage  to  an 
ecosystem  should  not  be  economically  discounted  over  a  period  of  years  (as  in 
the  amortization  of  equipment  and  capital  resources),  as  this  devalues  the 
importance  of  long-term  environmental  problems.  Thus,  regulatory  agencies  or 
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politicians  may  relegate  a  problem  to  a  lower  level  of  importance  because  the 
effect  will  only  be  felt  at  some  time  in  the  future  (f.g.,  global  warming). 

•  Willingness  to  pay:  This  assumes  that  market  prices  can  be  used  to  assess  the 
tastes  and  preferences  of  society.  The  problem  with  this  approach  is  that 
individuals  and  society  may  enjoy  the  services  provided  by  ecosystems  (i.e,, 
clean  air,  water,  weather  control,  food  chain  maintenance,  genetic  diversity) 
without  understanding  them  or  even  having  knowledge  of  their  existence.  Thus, 
their  willingness  to  pay  for  them  or  assign  values  to  such  services  may  be 
incorrect  and  inappropriate  relative  to  their  real  ecological  value. 

•  Multipliers:  Economic  analysis  of  benefits  always  includes  multipliers  (e.g. , 
developing  a  mine  results  in  jobs  for  construction  workers  and  a  demand  for 
materials).  Multipliers  should  also  be  used  in  the  risk  side  of  the  risk:benefit 
equation.  For  example,  the  loss  of  a  benthic  community  may  result  in  losses  to 
fisheries,  transportation,  fishing  equipment  manufacturers,  the  accommoda¬ 
tion  industry,  and  marina  operations. 

17.3  Special  issues  of  uncertainty  in  sediment  ecological  risk 
assessment 

17.3.1  Decision-making  with  sediment  toxicity  measurement  endpoints: 
exploring  effects  of  false  positive  and  false  negative  errors 

Sediment  risk  assessments  can  be  used  in  a  variety  of  applications,  including  the  assess¬ 
ment  of  relative  risk  between  an  impacted  site  and  a  reference  site  or  the  reduction  in  risk 
associated  with  a  remediation  action.  A  variety  of  chemical  and  biological  measurement 
endpoints  can  be  used  in  the  assessment,  including  laboratory  sediment  toxicity  tests 
and  SQGs  (see  Sections  18.2, 18.3,  and  18.7).  For  example,  sediment  toxicity  tests  are 
now  used  to  evaluate  the  relative  difference  in  organism  survival  or  growth  between  sedi¬ 
ments  from  reference  areas  and  dredged  material  (Chapter  5  on  dredging  assessment; 
USEPA  and  USAGE  1991, 1994).  Test  endpoints  such  as  mortality  or  growth  of  organisms 
exposed  in  the  laboratory  to  field-collected  sediments  are  assumed  to  reflect  the  response 
of  organisms  in  the  field  exposed  to  dredged  material. 

A  key  issue  in  the  use  of  sediment  endpoints  within  a  regulatory  or  programmatic  envi¬ 
ronment  is  the  level  of  confidence  in  the  results  of  this  assessment.  Using  the  above  ex¬ 
ample,  in  dredging  there  is  uncertainty  in  determining  1)  the  probability  of  stating  that 
the  reference  area  and  the  dredged  material  are  different  with  respect  to  toxicity  when  in 
fact  they  are  the  same  (false  positive  error)  and  2)  the  probability  of  stating  that  the  ref¬ 
erence  area  and  the  dredged  material  are  the  same,  when  in  fact  they  are  different  (false 
negative  error).  The  power  of  the  test  is  the  probability  of  finding  an  impact  when  it  oc¬ 
curs.  In  most  regulatory  applications  we  are  interested  only  in  a  single-direction  test  — 
whether  the  toxicity  of  the  dredged  material  is  greater  than  the  reference  (e.g.,  we  ignore 
any  information  showing  the  reference  toxicity  is  greater  than  the  dredged  area).  This 
type  of  statistical  approach  to  environmental  decision-making  achieves  the  goal  of  envi¬ 
ronmental  protection.  However,  some  problems  do  arise.  For  example,  if  enough  samples 
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are  taken,  the  reference  and  dredged  areas  can  always  be  shown  to  be  statistically  sepa¬ 
rable.  The  degree  of  separation  in  toxicity  can  be  very  small,  but  statistically  significant. 
We  could  frame  an  alternative  null  hypothesis  to  detect  a  difference  of  an  ecologically 
significant  magnitude.  While  this  has  scientific  appeal,  the  degree  of  difference  represent¬ 
ing  an  ecologically  significant  result  could  be  long  debated. 

In  classical  statistical  terms,  the  chance  of  false  positive  decisions  is  termed  a  Type  I  error 
(a),  the  chance  of  false  negative  decisions  is  considered  a  Type  II  error  (ft),  and  power  is 
1  -  ft.  Investigations  typically  focus  on  establishing  rigorous  Type  I  errors.  For  example, 
risk  managers  are  often  willing  to  risk  a  5%  chance  that  a  Type  I  error  occurs.  However, 
Type  II  errors  are  often  ignored,  or  no  definitive  Type  II  decision  criteria  are  established. 
In  classical  hypothesis  testing,  balancing  the  potential  occurrence  of  false  positive  and 
false  negative  results  is  a  function  of  the  number  of  samples  collected  and  the  variance  of 
the  sample  mean.  Type  I  and  Type  II  errors  are  mathematically  linked,  for  a  fixed  sample 
size  and  variance,  so  establishing  a  in  the  decision  criteria  determines  ft  (see  Steel  and 
Torrie  1980). 

The  interrelationship  of  Type  I  and  Type  II  errors  requires  consideration  of  the  relative 
importance  of  the  risks  associated  with  making  false  positive  and  false  negative  decisions. 
Risk  assessment  usually  focuses  on  reducing  the  risk  of  false  positive  results  associated 
with  statistical  analysis  (by  establishing  a  low  a  level).  However,  from  an  environmental 
protection  perspective,  emphasis  should  be  placed  on  reducing  the  risk  of  making  false 
negative  decisions  (/>.,  falsely  concluding  that  an  area  is  not  contaminated  when  it  actu¬ 
ally  is).  For  example,  an  environmentally  conservative  approach  would  emphasize  iden¬ 
tifying  small  differences  between  the  reference  and  test  areas.  Therefore,  it  would  be 
desirable  to  have  a  high  chance  of  classifying  a  site  as  clean  when  it  is  clean  (high  power), 
and  a  small  chance  of  falsely  classifying  the  reference  and  test  site  are  the  same  when  they 
are  different  (small  ft).  In  this  conservative  approach,  one  would  rather  make  an  error  in 
judging  a  clean  site  as  contaminated  than  in  misclassifying  a  contaminated  site  as  clean. 

In  contrast,  an  alternative  approach  would  be  to  classify  the  test  site  different  from  the 
reference  site  only  when  the  data  provide  a  large  degree  of  confidence  in  the  decision. 
Therefore,  one  would  want  to  reduce  the  error  in  stating  that  the  reference  site  and  test 
site  are  different  when  they  are  not.  This  is  accomplished  by  establishing  a  small  a,  with 
a  higher  probability  of  false  negative  results . 

17.3. 7. T  Case  study:  interlaboratory  variability 

The  chance  of  false  positive  and  false  negative  results  is  a  function  of  the  number  of 
samples  and  the  variance  of  the  test  endpoint.  As  an  example,  we  will  examine  variabil¬ 
ity  in  sediment  toxicity  tests.  While  many  sources  of  error  are  associated  with  these  tests 
(see  Sections  18.2  and  18.3),  the  following  example  focuses  only  on  uncertainty  associ¬ 
ated  with  interlaboratory  variability.  Interlaboratory  variance  (i.e.,  round-robin  or  ring 
testing)  has  been  extensively  studied  in  whole  effluent  toxicity  testing  (Parkhurst  et  al 
1992;  Warren-Hicks  and  Parkhurst  1992),  and  we  will  draw  on  this  earlier  work  in  this 
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analysis.  Data  for  the  analysis  are  obtained  from  a  round-robin  study  of  whole  sediment 
toxicity  tests  (USEPA  1994a;  ASTM  1995a-1995e;  Burton  etal  1996). 

A  key  issue  in  the  use  of  any  method  is  the  number  of  tests  required  for  a  specified  deci¬ 
sion  criteria.  In  this  example,  Burton  et  al  (1996)  reported  mean  survival  of  Chironomus 
tentans  in  10-d  whole  sediment  toxicity  tests.  Data  were  generated  from  eight  laborato¬ 
ries,  each  of  which  tested  split  samples  of  field-collected  sediment  using  the  toxicity  test 
method  described  in  USEPA  (1994a).  For  one  of  the  sediments  evaluated  in  the  study,  the 
mean  survival  among  the  eight  laboratories  was  76%  with  a  standard  deviation  of  27% 
(resulting  in  a  coefficient  of  variation  of  37%,  which  is  considered  acceptable 
interlaboratory  precision;  Burton  et  ah  1996).  The  data  consisted  of  survival  measure¬ 
ments  reported  by  each  of  eight  laboratories.  From  this  information,  the  number  of  labo¬ 
ratories  needed  to  achieve  a  specified  decision  criteria  can  be  estimated,  based  on 
pre-specified  probabilities  of  either  false  positive  or  false  negative  results.  For  any  one 
laboratory,  the  reported  survival  response  was  the  mean  of  8  replicate  tests,  each  repli¬ 
cate  test  consisting  of  10  organisms.  If  the  replicate  data  were  available,  we  would  have 
calculated  the  number  of  replicates  required  hy  each  of  the  eight  participating  laborato¬ 
ries  for  prespecified  decision  criteria  (the  data  were  not  available  at  the  time  of  this  analy¬ 
sis).  For  discussion  purposes  only,  we  use  the  laboratory  mean  data  and  present  an 
analysis  of  interlaboratory  variability.  The  method  for  estimating  intralaboratory  repli¬ 
cates  is  consistent  with  the  following  discussion. 

Suppose  that  an  investigator  is  faced  with  determining  the  sediment  toxicity  of  a  poten¬ 
tially  impacted  site.  Also,  assume  that  90%  survival  has  been  established  as  the  acceptable 
control  response.  Given  that  the  investigator  has  no  prior  knowledge  of  the  site  toxicity, 
an  assumption  was  made  that  the  sediment  is  about  as  toxic  as  that  in  the  above  refer¬ 
enced  Burton  et  al  (1996)  study.  Alternatively,  the  investigator  could  conduct  a  pilot 
study  of  the  site  instead  of  making  this  assumption.  Given  these  data,  the  investigator 
wishes  to  determine  the  number  of  laboratories  necessary  to  achieve  a  specified  decision 
criterion  based  on  the  chance  of  false  positive  and  negative  results.  (Note:  A  somewhat 
related  concept  is  minimum  detectable  difference  [MDD]  [USEPA  and  USAGE  1991, 
1994],  The  MDD  is  generally  used  to  establish  the  minimum  difference  detectable  be¬ 
tween  a  control  and  response  solution  for  a  single  toxicity  test,  given  a  fixed  sample  size 
and  variance.  This  concept  may  be  adaptable  to  our  example  by  evaluating  the  MDD 
between  a  reference  site  and  a  dredge  site,  for  a  fixed  number  of  laboratories  with  known 
interlaboratory  variance.) 

The  following  equation  provides  a  means  of  determining  the  desired  number  of  labora¬ 
tories,  while  balancing  the  chance  of  false  negative  and  false  positive  test  results: 


N  =a^ 


Cs-^i 
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where 

N  =  the  number  of  laboratories  required  to  meet  specified  levels  of  Type  I  and 
Type  II  errors, 

Zi  _  u  =  the  critical  value  of  1  -  for  the  normal  distribution  {e.g.,  1.64  for  a  1- 
sided  test  with  a  =  0.05  error  probability), 

Zi  =  the  critical  value  of  1  -  a  for  the  normal  distribution  {e.g.,  1.64  for  a  1- 
sided  test  with  an  a  =  0.05  error  probability), 

Cj  =  the  specified  standard  {i.e.,  90%  survival),  and 
\il  =  the  average  percent  survival  across  laboratories. 

Figure  17-3  presents  a  plot  of  the  power  of  a  1-sided  test  of  the  null  hypothesis 

Hq:  Cs  =  \ii 

against  the  alternative  hypothesis 

Cs  > 

(Note:  because  we  are  only  concerned  if  the  toxicity  of  the  site  is  less  than  the  standard, 
a  1-sided  test  of  the  hypothesis  is  appropriate.) 

Notice  that  fixing  either  the  power  of  the  test  (1  -  fi)  or  the  Type  I  error  rate  (a)  estab¬ 
lishes  the  other.  For  example,  with  a  false  positive  and  false  negative  error  probability  of 
5%,  33  laboratories  are  required  for  testing.  With  a  false  positive  and  false  negative  error 
rate  of  10%,  20  laboratories  are  required  for  testing. 

77.3.1.2  Summary 

The  above  example  demonstrates  a  method  for  estimating  the  number  of  laboratories, 
given  Type  I  and  Type  II  errors.  The  choice  of  how  much  error  is  acceptable  is  up  to  the 
investigator.  The  investigator  should  carefully  consider  the  relative  merits  and  interpre¬ 
tations  of  Type  I  and  Type  II  errors  when  evaluating  the  results  from  any  sediment  mea¬ 
surements  used  to  establish  the  possibility  of  contamination. 

17.3.2  Uncertainty  in  estimating  spatial  and  temporal  distributions  of 
contaminants  in  sediment 

Sediments  may  be  highly  variable  on  both  a  spatial  and  a  temporal  basis.  Therefore,  rep¬ 
licate  samples  need  to  be  collected  at  each  site  to  determine  variance  in  sediment  charac¬ 
teristics.  Sediment  should  be  collected  with  as  little  disruption  as  possible;  however, 
subsampling,  compositing,  or  homogenization  of  sediment  samples  may  be  required  for 
some  experimental  designs  {e.g.,  USEPA  1994a,  1994b;  ASTM  1995a-1995e;  Environ¬ 
ment  Canada  1996a,  1996b).  Sampling  locations  might  be  distributed  along  a  known 
pollution  gradient,  in  relation  to  the  boundary  of  a  disposal  area,  or  sampling  locations 
may  be  identified  as  being  contaminated  in  a  reconnaissance  survey.  These  comparisons 
can  be  made  in  both  space  and  time.  In  pre-dredging  studies,  a  sampling  design  can  be 
developed  to  assess  the  contamination  of  samples  representative  of  the  project  area  to  be 
dredged  (Chapter  5).  Such  a  design  should  include  subsampling  cores  taken  to  the 
project  depth. 
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Number  of  Laboratories 

Q-  Level:  -  oc  =  0.05  - a  =  0.10  - a  =  0.20 

Figure  17-3  Number  of  laboratories  as  a  function  of  Type  I  error  and  power 


When  dealing  with  a  given  sampling  area  {ix.,  river,  lake,  estuary),  the  appropriate  sam¬ 
pling  design  is  of  importance  since  the  goal  might  be  to  describe  existing  conditions  for 
the  entire  area  by  collecting  a  discrete  number  of  samples.  The  choice  of  the  sampling 
scheme  is  also  important  for  the  intended  data  manipulation.  Fewer  samples  are  needed 
if  the  objective  of  the  study  is  just  to  describe  the  average  conditions  over  the  entire  area. 
On  the  other  hand,  if  the  sampling  is  to  be  used  to  draw  a  map  of  distribution  (/>.,  high¬ 
light  point  sources,  trends  of  distribution,  location  and  area  of  contamination),  the 
choice  of  the  sampling  net  (regular,  random,  or  fixed  grid)  may  dictate  which  type  of 
mapping  system  has  to  be  used  (Baudo  1990).  In  addition,  if  temporal  variability  is  ex¬ 
pected,  sampling  should  be  repeated  as  many  times  as  possible  to  reduce  this  source  of 
uncertainty.  Sampling  frequency  is  particularly  critical  since  the  timing  of  the  successive 
samples  can  only  be  used  to  evaluate  the  change  on  the  selected  time  scale. 

77.3.2.1  Estimation  and  measurement  of  magnitude  of  uncertainty 

The  overall  uncertainty  of  sampling  depends  on  several  factors,  including  sample  1)  type, 
2)  volume,  3)  equipment,  4)  handling,  5)  number,  and  6)  replicates. 

Sample  type.  Sediment  is  a  complex  mixture  of  solid,  aqueous,  and  gaseous  phases,  in 
addition  to  biotic  compartments.  Hence,  study  objectives  must  clearly  indicate  which 
type  of  medium  is  to  be  sampled.  For  example,  different  methods  may  be  needed  to  sub¬ 
sample  pore  water  in  sediment  or  to  sample  benthic  organisms  in  sediment.  On  the  other 
hand,  the  objective  of  the  study  may  be  to  sample  the  whole  “active”  layer  (e.g.,  to  calcu¬ 
late  a  diversity  index)  or  sample  the  vertical  microstructure  of  some  sediment  character¬ 
istic  (e.g„  the  vertical  profiles  for  redox  or  oxygen).  Once  the  type  of  sample  required  has 
been  identified,  the  choice  of  the  sampling  gear  must  be  made  accordingly.  It  is  often 
difficult  to  determine  the  appropriate  depth  of  sediment  to  sample  (e.g.,  How  deep  must 
a  core  be?  How  deep  is  the  bioturbation?  Where  is  the  boundary  between  the  oxic  and 
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the  anoxic  layers?).  A  common  mistake  is  to  sample  at  the  maximum  depth  in  the  sedi¬ 
ment  which  will  potentially  provide  an  unrealistic  estimate  of  exposure  {i.e.,  sampling 
below  the  biologically  active  zone).  Finally,  the  performance  of  the  selected  sampling  gear 
is  not  constant  (depending  on  the  kind  of  substrate  and  the  operating  conditions,  includ¬ 
ing  skill  of  operators).  In  most  cases,  when  sampling  is  made  without  actually  seeing 
what  is  being  collected,  the  uncertainty  can  be  assessed  only  after  the  sample  is  recovered 
{e.g.,  via  visual  observation  of  core  length,  texture,  or  color)  or  processed  in  the  labora¬ 
tory.  As  a  consequence,  the  degree  of  uncertainty  is  different  when  sampling  is  done 
blind  or  is  done  visually  by  checking  the  performance  of  the  sampling  gear  (Tables  17-1 
and  17-2). 

Sample  volume.  Some  variables  may  exhibit  pseudo-continuous  distribution  in  space 
(e.g.,  grain  size  where  particles  are  sorted  according  to  the  hydraulics  of  the  system), 
whereas  other  variables  often  have  a  pronounced  patchiness  (e.g.,  benthic  invertebrate 
distributions).  Hence,  the  sample  volume  should  account  for  the  known  or  estimated 
local  micro-spatial  heterogeneity  (both  horizontal  and  vertical)  at  the  scale  of  the  sam¬ 
pling  tool.  Larger  samplers  will  provide  an  “averaged”  sample  with  an  increased  uncer¬ 
tainty  of  measuring  smaller-scale  heterogeneity.  If  variables  with  substantially  different 
distributions  have  to  be  measured,  repeated  sampling  with  different  samplers  should  be 
considered.  Subsampling  of  sediments  is  often  required;  therefore,  samples  should  be 
thoroughly  homogenized  before  splits  are  made  (USEPA  1994a;  ASTM  1995a).  The 
amount  of  sediment  required  for  analyses  can  range  from  a  few  micrograms  (e.g.,  carbon- 
hydrogen-nitrogen  [CHN]  analysis)  to  several  kilograms  (e.g,  radioactive  isotopes,  labo¬ 
ratory  toxicity  tests).  Hence,  the  minimum  amount  of  sample  needed  to  perform  all 
analyses  must  be  estimated  before  choosing  the  sampling  equipment  and  may  limit  the 
number  of  measurements  performed  on  a  sample.  For  example,  if  a  coring  tube  is  used  to 
sample  sediment,  the  diameter  and  the  thickness  of  sections  need  to  provide  enough 
material  for  all  of  the  planned  analyses  must  be  calculated.  The  same  sample  is  typically 
used  for  more  than  one  analysis.  Hence,  a  compromise  would  be  to  use  the  original  sec¬ 
tions  for  measurements  requiring  small  aliquots  and  to  use  pooled  sections  for  measure¬ 
ments  requiring  larger  aliquots.  It  should  be  noted  that  this  procedure  potentially  limits 
subsequent  statistical  analysis  of  data  (e.g,  correlations,  principal  component  analysis, 
cluster  analysis)  which  can  be  done  only  with  paired  data.  A  special  case  where  the 
sample  volume  is  particularly  critical  is  evaluating  porewater  composition  of  sediment. 
In  this  case,  the  water  content  of  the  sediment  must  be  estimated  in  advance  to  be  sure 
enough  water  can  be  extracted  from  each  sample. 

In  summary,  to  increase  confidence  and  lower  uncertainty,  the  sample  volume  must  be 
large  enough  to  provide  a  representative  sample  for  each  measurement  endpoint  of  inter¬ 
est,  enough  material  to  perform  all  analyses  plus  further  measurements  that  may  be 
needed  (i.e.,  TIE;  Chapter  16).  Often  these  two  requirements  are  difficult  to  achieve  due 
to  lack  of  knowledge  and  the  need  to  minimize  the  sampling  effort.  However,  it  should  be 
kept  in  mind  that  collection  of  additional  sample  material  will  result  in  a  different 
sample. 
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Table  17-1  Degree  of  uncertainty  in  blind  sampling 


Sample 

Knowledge 

Systematic  errors 

Random  errors 

Type 

L 

L 

L 

Volume 

L 

L 

M 

Equipment 

L 

H 

H 

(nonconventional) 

H 

H 

H 

Handling 

L 

L 

M 

Number 

H 

H 

H 

Repetitions 

H 

L 

H 

L=  low,  M  =medium,  H  =  high 

Table  17-2  Degree  of  uncertainty 

in  visual  sampling 

Sample 

Knowledge 

Systematic  errors 

Random  errors 

Type 

L 

L 

L 

Volume 

L 

L 

L 

Equipment 

L 

L 

L 

(nonconventional) 

H 

H 

H 

Handling 

L 

L 

M 

Number 

H 

L 

H 

Repetitions 

H 

L 

L 

L=  low,  M  =medium,  H  =  high 


Sampling  equipment  (dredges,  grabs,  carers).  All  types  of  sampling  equipment 
vary  in  performance,  and  each  device  has  specific  advantages  and  disadvantages  (i.e.,  size, 
weight,  triggering  mechanisms;  ASTM  1995b).  Few  comparisons  have  been  reported 
dealing  with  sampling  efficiency  between  types  of  equipment  (Baudo  1990);  however, 
different  types  of  dredges,  grabs,  and  corers  provide  unique  types  of  samples.  Thus, 
choice  of  equipment  for  both  whole-sediment  and  porewater  sampling  depends  on  the 
study  objectives,  measurement  endpoint  of  interest,  characteristics  of  the  study  area, 
sediment  type  and  compactness,  and  the  presence  of  interfering  flora  or  fauna  (e.g.,  roots, 
shells)  (Baudo  1990;  Adams  1991;  Mudroch  and  MacKnight  1991;  ASTM  1995b).  More¬ 
over,  allowance  should  be  made  for  the  different  performance  of  the  same  sampler  de¬ 
pending  on  the  environment  in  which  it  is  used  (e.g,  soft  bottom  or  sand).  The 
equipment  can  alter  or  contaminate  the  sample  (e.g.,  metal  or  plastic  in  contact  with 
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sample,  cleaning  of  the  sampler)  or  the  equipment  may  produce  artifacts  due  structural 
limitations  (e.g.,  washing  out  of  finer  material,  gas  or  temperature  changes).  Mudroch 
and  MacKnight  (1991)  and  ASTM  (1995b)  provide  additional  details  regarding  sampling 
equipment. 

In  addition  to  dredges,  grabs,  and  corers,  a  number  of  nonconventional  sampling  tools 
have  been  applied  for  specific  purposes,  including  peepers  (dialysis  chambers  for  collec¬ 
tion  of  pore  water),  sedimentation  traps,  and  artificial  substrates  (for  benthic  coloniza¬ 
tion  studies).  Although  these  and  other  nonconventional  tools  may  provide  useful 
information,  lack  of  standardization  may  lead  to  a  high  uncertainty  with  their  use.  To 
summarize,  any  sampling  equipment  may  introduce  a  marked  uncertainty  since  it  is 
largely  unknown  whether  a  representative  sample  has  been  collected. 

Sample  handling.  Techniques  of  sample  conservation  and  manipulation  should  be 
carefully  examined  using  specific  equipment  to  prevent  not  only  the  contamination  of 
the  samples  but  also  possible  alterations  (Mudroch  and  Bourbonniere  1991;  ASTM 
1995b).  Alteration  of  the  sediment  usually  remains  undetected  unless  specific  studies  and 
designs  are  used  (e.g.,  repeated  measures  at  different  times  or  after  each  sampling  step). 
Hence,  identification  of  new  handling  artifacts  may  make  results  of  previous  studies 
questionable.  In  order  to  minimize  uncertainty,  a  consensus  method  should  be  estab¬ 
lished  and  followed;  however,  periodic  revisions  may  be  required. 

Number  of  samples.  Switzer  (1975)  pointed  out  a  common  statistical  problem:  the  es¬ 
timation  of  the  required  number  of  sampling  points  can  be  determined  statistically  only 
after  the  data  have  been  gathered,  or  if  some  estimate  of  crucial  sediment  characteristics 
are  obtained  in  preliminary  studies.  A  number  of  approaches  can  be  used  to  estimate  the 
minimum  number  of  samples  needed  to  estimate  an  average  value  for  the  measurement 
of  interest  (Baudo  1990).  Information  needed  to  determine  the  number  of  samples  in¬ 
cludes  heterogeneity  in  physical  and  chemical  data  (Kratochvil  and  Taylor  1981;  Sokal 
and  Rohlf  1981;  Hakansson  and  Jansson  1983).  The  required  number  of  samples  also 
depends  on  the  distribution  of  the  data  (normal,  Poisson,  negative  binomial  distribu¬ 
tions).  A  detailed  description  of  the  statistical  properties  of  these  distributions  can  be 
found  elsewhere  (Bliss  and  Fisher  1953;  Sokal  and  Rohlf  1981). 

For  the  definitive  sampling  of  an  area,  additional  considerations  including  directionality 
and  point  sources  may  dictate  the  sampling  plan.  The  sampling  strategies  in  these  cases 
can  be  classified  in  three  main  types  (Hakanson  and  Jansson  1981;  Baudo  1990):  1)  a 
deterministic  system,  with  a  sampling  design  based  on  previous  information  and  varying 
density;  2)  a  stochastic  system,  when  the  sampling  stations  are  randomly  selected;  and  3) 
a  regular  grid  system,  with  the  sampling  stations  randomly  or  deterministically  selected. 
Advantages  and  disadvantages  of  each  method  are  discussed  in  Baudo  (1990). 

Repetitive  sampling.  More  often  than  not,  only  one  sample  per  station  is  collected. 
Repetitive  sampling  allows  for  an  estimate  of  the  local  spatial  or  temporal  heterogeneity 
(USEPA  1994a,  1994b;  ASTM  1995a-1995e).  An  accurate  estimation  of  the  sampling 
variability  (within  the  station  and  among  stations)  is  needed  to  avoid  false  positive  or 
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false  negative  decisions  and  assumes  an  even  greater  importance  when  assessing  spatial 
or  temporal  variability.  The  obvious  disadvantages  of  repeated  sampling  are  the  in¬ 
creased  cost  and  time  although  the  increased  costs  of  collecting  two  or  more  samples 
from  the  same  station  can  be  quite  low,  especially  if  a  multi-sampler  can  be  used.  How¬ 
ever,  it  will  be  much  more  expensive  to  perform  all  of  the  analyses  on  each  sample.  Alter¬ 
natively,  a  representative  measure  could  be  made  on  all  sample  pairs  to  evaluate  local 
heterogeneity  {i.e.,  CHN  analysis  which  may  be  related  to  distributions  of  metals  and 
organic  contaminants). 

An  extrapolation  to  biological  variables  may  be  more  difficult.  Samples  for  biological 
measures  are  often  pooled  to  provide  an  average  of  the  local  variability.  For  example,  if 
the  purpose  of  the  study  is  to  conduct  a  reconnaissance  survey  to  identify  contaminated 
areas  for  further  investigation,  the  experimental  design  might  include  collection  of  just 
one  composited  sample  from  each  area  to  allow  for  sampling  a  larger  area.  The  lack  of 
replication  at  an  area  usually  precludes  statistical  comparisons  {e.g.,  ANOVA),  but  these 
surveys  can  be  used  to  identify  contaminated  areas  for  further  study  or  these  data  can  be 
evaluated  using  statistical  regressions  (USEPA  1994a,  1994b;  ASTM  1995a).  In  other  in¬ 
stances,  the  purpose  of  the  study  might  be  to  conduct  a  quantitative  sediment  survey  to 
determine  statistically  significant  differences  between  control  (or  reference)  sediments 
and  test  sediments  from  one  or  more  areas.  The  number  of  replicates  per  site  should  be 
based  on  the  need  for  sensitivity  or  power.  In  a  quantitative  survey,  field  replicates  (sepa¬ 
rate  samples  from  different  grabs  collected  at  the  same  area)  would  need  to  be  taken  at 
each  site.  Separate  subsamples  from  the  field  replicates  might  be  used  to  determine 
within-sample  variability  or  for  comparisons  of  test  procedures  {e.g.,  comparative  sensi¬ 
tivity  among  test  organisms),  but  these  subsamples  cannot  be  considered  true  field  rep¬ 
licates  for  statistical  comparisons  among  areas. 

17,3,2.2  Accounting  for  and  reducing  uncertainty 

MacKnight  (1991)  identified  the  following  factors  as  most  important  to  identifying  sam¬ 
pling  options;  1)  purpose  of  sampling,  2)  study  objectives,  3)  historical  data  and  other 
available  information,  4)  bottom  dynamics  at  the  sampling  area,  5)  size  of  the  sampling 
area,  and  6)  available  funds  versus  estimated  (real)  cost  of  the  project.  Factors  1  and  2  are 
obviously  critical  and  must  be  agreed  upon  in  advance  between  managers  and  scientists 
involved  in  the  project.  This  could  easily  be  the  most  important  source  of  uncertainty, 
since  “there  is  no  one  formula  for  design  of  a  sediment  sampling  pattern  which  would  be 
applicable  to  all  sediment  sampling  programs”  (MacKnight  1991).  Since  inadequate 
strategies  and  unclear  goals  of  sediment  sampling  are  among  the  most  important  sources 
of  uncertainty,  the  early  involvement  of  managers  in  the  sampling  plan  should  be  sought. 

In  addition,  to  reducing  the  sampling  bias,  the  proposed  project  should  be  peer  reviewed 
by  scientists  with  expertise  in  each  one  of  the  fields  of  study  covered  by  the  project.  This 
peer  review  should  evaluate  the  adequacy  of  the  sample  media,  sample  volume,  sampling 
equipment,  sample  handling,  sample  number,  and  repetitions.  Information  related  to 
Factors  3  (historic  data)  and  4  (dynamics  of  bottom  sediment)  listed  above  are  often  not 
available  to  assist  in  the  selection  of  sampling  areas  (number  and  location)  with  the  re- 
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quired  degree  of  confidence.  This  information  is  needed  to  assure  an  unbiased  assess¬ 
ment  of  Factor  5  (size  of  the  sampling  area)  and  Factor  6  (costs).  In  any  case,  the  choice 
of  sampling  plan  should  support  a  statistical  evaluation  of  the  sampling  variability  (both 
spatial  and  temporal).  Hence,  a  pilot  study  is  usually  needed  to  establish  local  spatial  and 
temporal  heterogeneity  before  the  definitive  sampling  is  performed.  Whenever  feasible, 
in  situ  tools  should  be  used  in  the  pilot  study  to  estimate  distributions  in  relevant  physi¬ 
cal,  chemical,  and  biological  measurements  {e.g.,  sediment  compactness,  echo-sounding, 
pH,  oxygen,  redox,  biotic  communities).  The  extrapolation  to  field  conditions  from  data 
gathered  on  samples  transferred  to  the  laboratory  is  always  subject  to  uncertainty.  For 
this  reason,  there  is  a  need  for  developing  in  situ  techniques  for  measuring  chemistry, 
toxicity  testing,  and  benthic  community.  Alternatively,  uncertainty  associated  with  sam¬ 
pling  could  be  substantially  lowered  by  visually  checking  the  sampling  sites  {e.g.,  by  us¬ 
ing  divers,  submersible  cameras,  manned  vehicles;  Tables  17-1  and  17-2). 

77.3.2.3  Interpretation  of  uncertainty  in  relation  to  decision-making 

Assuming  all  samples  are  used  for  analysis,  the  number  of  samples  determine  the  cost  of 
the  study.  Hence,  there  is  a  need  to  limit  the  number  as  much  as  possible  and  still  retain 
confidence  that  the  final  results  will  be  both  sound  and  defensible.  Too  few  samples  will 
result  in  large  variability  and  may  result  in  the  need  for  additional  sampling,  whereas  too 
many  samples  will  result  in  a  waste  of  resources.  On  the  other  hand,  the  uncertainty  as¬ 
sociated  with  measurement  endpoints  outlined  in  Chapter  18  should  be  weighed  relative 
to  the  cost-benefit  analysis  for  the  potential  remedial  options.  An  overestimate  of  the 
actual  contamination,  both  in  terms  of  concentration  and  the  distribution  in  the  study 
area,  may  lead  to  an  inflated  cost  for  remediation  and  for  disposal.  An  underestimate 
could  result  in  a  wrong  choice,  for  example,  no  remedial  action,  improperly  managed 
disposal,  or  limited  intervention. 

77.3.2.4  Summary 

Uncertainty  in  sampling  is  typically  unaccounted  for  in  most  risk  assessments.  This  un¬ 
certainty  can  result  from  either  poor  knowledge  of  the  real  performance  of  the  samplers 
or  from  an  inadequately  designed  sampling  plan.  In  addition,  both  systematic  and  ran¬ 
dom  errors  can  occur  and  usually  remain  unknown  or  undetected.  Furthermore,  the 
uncertainty  associated  with  sampling  is  much  higher  in  cases  when  the  actual  sampling 
is  done  under  “blind”  conditions  (i.e.,  sampling  from  a  boat;  Table  17-1)  compared  to 
when  the  operators  can  see  the  sampling  medium  (shallow  areas  sampled  by  hand  or 
with  visual  aids;  Table  17-2).  The  uncertainty  in  describing  temporal  variability  is  usually 
greater  since  it  compounds  the  uncertainty  in  spatial  heterogeneity  with  uncertainty  in 
repeated  sampling.  Tables  17-1  and  17-2  provide  an  indication  of  overall  uncertainty  in 
sampling,  and  assuming  that  standardized  procedures  are  followed,  the  reliability  of  the 
selected  sampling  method  is  considered,  and  allowances  are  made  for  systematic  or  ran¬ 
dom  errors.  Sampling  uncertainty  has  components  which  can  be  reduced  by  1)  planning 
the  sampling  according  to  the  existing  information,  2)  using  appropriate  collection  and 
handling  methods,  3)  conducting  pilot  studies,  and  4)  measuring  the  different  sources  of 
variability  in  the  definitive  study. 
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17.3.3  Error  and  uncertainty  in  models  applied  in  sediment  ecological 
risk  assessments 

Exposure  models  are  used  to  predict  concentrations  of  contaminants  in  the  physical  en¬ 
vironment  (sediments,  pore  water,  and  overlying  water)  and  concentrations  in  organisms 
as  a  function  of  space  and  time  (Chapter  15).  Two  types  of  system-level  models  are  typi¬ 
cally  applied  to  evaluate  contaminated  sediments;  1)  contaminant  transport  and  fate 
models  and  2)  food  chain  models.  Sections  18.5  and  18.6  (Chapter  18)  discuss  uncer¬ 
tainty  associated  with  specific  models  used  to  evaluate  bioavailability  of  contaminants 
associated  with  sediments.  System-level  models  should  be  able  to  predict  contaminant 
concentrations  as  affected  by  natural  or  anthropogenic  events.  The  goal  of  modeling 
should  be  to  develop  a  predictive  model  (i.e.,  a  model  that  is  based  on  parameters  which 
can  be  measured  accurately  in  the  laboratory  or  by  means  of  simple  field  tests).  Compu¬ 
tations  should  then  be  based  on  these  parameters  with,  ideally,  no  calibration  or  fine- 
tuning.  In  this  way,  one  can  have  confidence  in  the  predictions  and  can  also  use  the 
model  to  evaluate  different  environmental  conditions  and  different  systems,  again  with 
little  or  no  additional  calibrating. 

17.3.3.  T  Model  calibration 

When  a  model  is  calibrated,  the  calibration  is  only  valid  for  the  data  used  in  the  calibra¬ 
tion.  In  order  to  illustrate  this  point,  consider  predicting  contaminant  concentrations  in 
a  lake  over  25  years.  During  this  time,  a  few  large  storms  can  occur  and  these  storms  may 
be  responsible  for  most  of  the  sediment  and  contaminant  transport.  If  the  model  is  cali¬ 
brated  to  data  taken  in  an  “average”  year  (i.e.,  when  the  large  storms  did  not  occur),  then 
parameters  and  results  will  be  incorrect  because  they  did  not  include  extreme  events.  If, 
on  the  other  hand,  the  model  is  calibrated  to  data  taken  in  a  very  stormy  time,  the  param¬ 
eters  and  extrapolated  results  will  also  be  incorrect.  A  predictive  model  needs  to  be  based 
on  the  concept  that  the  future  is  statistically  the  same  as  the  past.  What  is  known  is  only 
that  events  of  a  certain  magnitude  have  a  certain  probability  of  occurring.  Predictive 
models  should  then  be  able  to  predict  the  most  probable  result  and  the  probabilities  of 
the  results  of  different  sequence  of  events. 

As  an  example  of  the  effect  of  different  input  parameters  on  the  variability  of  model  out¬ 
put,  consider  the  prediction  of  PCB  half-life  in  Lake  Ontario  made  by  three  independent 
groups  (Limnotech,  Manhattan  College,  and  University  of  Toronto).  All  of  these  models 
are  based  on  the  concept  of  a  well-mixed  sediment  layer  (Table  17-3;  Ziegler  and  Connolly 
1995).  The  effect  of  this  layer  on  sediment  fluxes  is  dependent  on  the  thickness  of  the 
layer  (typically  a  poorly  defined  characteristic).  Because  of  this,  estimates  of  half-life  differ 
by  almost  one  order  of  magnitude,  from  3  years  to  25  years. 

17.3.3.2  Errors  and  uncertainties  in  contaminant  transport-’and^fate 
process  models 

In  order  to  quantitatively  understand  and  predict  the  environmental  effects  of  contami¬ 
nated  sediments,  especially  as  influenced  by  natural  large  episodic  events  or  by  remedial 
actions,  a  knowledge  of  the  transport  and  fate  of  the  sediments  and  the  contaminants 
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Table  17-3  Results  of  three  different  predictions  of  half-life  of  PCBs  in  Lake  Ontario 


Investigators 

Assumed  thickness  of  layer 

PCB  half-life 

Limnotech 

15  cm 

25yrs 

Manhattan  College 

8  cm 

15yrs 

University  of  Toronto 

0.5  cm 

3yrs 

Source:  Ziegler  and  Connolly  (1995). 


associated  with  these  sediments  is  necessary  (Chapter  15).  Some  of  the  more  significant 
processes  that  need  to  be  understood  and  quantified  include  1)  the  resuspension,  ero¬ 
sion,  and  sediment  bed  dynamics;  2)  sorption  of  contaminants  to  particles  and  colloids; 
3)  flocculation,  settling  speeds,  and  deposition  rates  of  particles  and  floes;  4)  hydrody¬ 
namics,  including  currents  and  wave  action;  5)  air-water  exchange  of  contaminants;  6) 
biochemical  reactions  and  degradation  of  contaminants;  and  7)  the  inputs  of  contami¬ 
nants  from  the  surrounding  land,  atmospheric,  and  point  discharges. 

The  process  most  relevant  to  estimates  of  contaminant  fluxes  at  the  sediment-water  inter¬ 
face  are  processes  1  to  4  listed  above.  These  sediment-water  exchange  processes  are  im¬ 
portant  because  these  factors  control  phase  distributions,  bioavailability,  and 
contaminant  concentrations  in  the  sediments  and  overlying  water  (Chapter  15).  The  flux 
of  contaminants  to  surface  waters  from  the  surrounding  land  resulting  from  nonpoint 
discharges  is  also  not  well  quantified.  Point  discharges  are  generally  better  known  and 
controlled. 


17.3.3.3  Relevance  to  decision-making 

In  some  form  or  another,  models  are  always  used  in  organizing  and  interpreting  data 
and,  therefore,  in  decision-making.  These  models  may  be  simple  conceptual  models  or 
they  may  be  complex  models  involving  many  physical,  chemical,  and  biological  processes 
described  by  large  numbers  of  differential  equations.  The  solutions  to  these  models  may 
be  simple  estimates  or  large  arrays  of  numbers.  Models  should  help  in  making  decisions 
(c.g.,  selection  of  remedial  options  and  understanding  the  effects  of  these  remedial  ac¬ 
tions).  Because  of  errors  in  models,  more  complex  models  are  not  necessarily  more  accu¬ 
rate  or  more  helpful  than  simple  models.  However,  simple  models  are  often  based  on 
larger  numbers  of  assumptions  and  may  thus  be  inaccurate.  Although  complex  models 
may  address  these  assumptions  more  appropriately,  they  require  more  input  informa¬ 
tion,  which  makes  them  less  useful.  In  some  cases,  complex  models  may  have  more  as¬ 
sumptions  (U.,  input  parameters)  which,  if  not  verified,  lead  to  higher  uncertainty.  In  all 
modeling,  the  potential  errors  of  the  model  should  be  understood  and  quantified.  This 
information  needs  to  be  transmitted  to  the  risk  manager  both  at  the  problem  formula¬ 
tion  stage  and  at  the  risk  management  stage  (Figure  1-1).  Because  of  natural  variability, 
models  should  give  the  most  probable  outcome  of  a  sequence  of  events. 
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17.4  Conclusions  and  recommendations 

•  Guidelines  have  been  developed  for  conducting  ERAs  to  promote  consistency  in 
design,  analysis,  and  interpretation  of  data.  These  guidelines  also  allow  for  the 
establishment  of  quality  standards  and  consistent  terminology  for  assessments. 
Consistency  in  the  use  of  guidelines  can  help  inform  all  stakeholders  as  to  the 
relative  degree  of  confidence  and  scientific  knowledge  under  which  a  decision 
was  made.  Additionally,  the  best  way  to  reduce  initial  uncertainties  in  the  risk 
assessment  is  to  involve  all  interested  parties  through  stakeholder  input  and 
expert  opinion  surveys.  This  involves  asking  all  interested  parties  (including  the 
risk  managers,  the  scientific  community,  and  the  public)  to  define  the  problem 
in  a  form  of  a  concise  narrative.  Once  the  problem  has  been  identified,  appro¬ 
priate  assessment  and  measurement  endpoints  can  be  selected  and  applied. 

•  Sources  of  uncertainty  unique  to  characterizing  exposure  or  effects  in  SERAs 
include  1)  sample  location,  collection,  and  handling  errors;  2)  spatial  and 
temporal  heterogeneity  of  the  stressor;  3)  references  for  comparison  of  stressor 
levels;  4)  substrate  type  and  interactions  with  stressor;  5)  nonequilibrium  of 
chemical  stressor  between  sediment  and  water;  6)  effects  of  noncontaminant 
stressors  in  toxicity  tests;  and  7)  laboratory-to-field  extrapolations. 

•  Risk  characterization  requires  consideration  of  the  relative  importance  of  the 
errors  associated  with  making  both  false  positive  and  false  negative  determina¬ 
tions.  Risk  assessment  usually  focuses  on  reducing  the  risk  of  false  positive 
results  associated  with  statistical  analysis  (by  establishing  a  low  a  level). 
However,  from  an  environmental  protection  perspective,  emphasis  should  also 
be  placed  on  reducing  the  risk  of  making  false  negative  decisions  {i.e.,  falsely 
concluding  that  an  area  is  not  contaminated  when  it  actually  is). 

•  Sediment  sampling  has  uncertainty  components  which  can  be  reduced  by 
planning  the  sampling  program  according  to  the  existing  information,  using 
appropriate  collection  and  handling  methods,  conducting  pilot  studies,  and 
measuring  the  different  sources  of  variability  in  the  definitive  study.  Addition¬ 
ally,  uncertainty  associated  with  sampling  of  sediment  can  be  substantially 
lowered  by  visually  checking  the  sampling  sites. 

•  Exposure  models  are  used  to  predict  concentrations  of  contaminants  in  the 
physical  environment  and  concentrations  in  organisms  as  a  function  of  space 
and  time.  The  models  should  be  predictable  of  contaminant  concentrations  as 
affected  by  natural  or  anthropogenic  events.  The  goal  of  modeling  should  be  to 
develop  a  predictive  model  {i.e.,  a  model  which  is  based  on  parameters  which 
can  be  measured  accurately  in  the  laboratory  or  by  means  of  simple  field  tests). 
Computations  should  then  be  based  on  these  parameters  with,  ideally,  no 
calibration  or  fine-tuning.  In  this  way,  one  can  have  confidence  in  the  predic¬ 
tions  and  can  also  use  the  model  to  evaluate  different  environmental  conditions 
and  different  systems,  again  with  little  or  no  additional  calibration. 
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•  The  outcome  of  all  risk  management  actions  should  either  be  the  acceptance  or 
the  reduction  of  the  risk.  Risk  reduction  involves  many  potential  actions,  which 
range  from  technical  through  socioeconomic  to  the  political.  In  undertaking 
risk  management,  it  is  necessary  to  decide  which  risks  must  be  managed  and  in 
what  priority. 

•  Strategic  actions  for  addressing  uncertainty  in  SERAs  are  listed  in  Section  18.8. 
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18.1  Introduction 

Earlier  chapters  have  recommended  procedures  for  conducting  SERAs  related  to  product 
safety  assessments  (Chapters  3  and  4),  navigational  dredging  decisions  (Chapter  5), 
cleanup  decisions  (Chapters  6  and  7),  and  general  issues  related  to  evaluation  of  uncer¬ 
tainty  associated  with  conducting  SERAs.  This  chapter  describes  the  uncertainty  associ¬ 
ated  with  specific  measurement  endpoints  commonly  used  in  SERAs  and  discusses 
approaches  for  addressing  these  sources  of  uncertainty.  These  stepwise  procedures  pro¬ 
vide  explicit  guidance  on  the  four  major  elements  of  ERAs:  problem  formulation,  expo¬ 
sure  assessment,  effect  assessment,  and  risk  characterization  (Chapter  1).  In  addition, 
several  types  of  measurement  endpoints  were  identified  at  the  workshop  which  have 
been  used  to  conduct  SERAs  including  1)  toxicity  tests  (both  the  fraction  tested  [Section 
18.2]  and  the  endpoints  selected  [Section  18.3]),  2)  benthic  invertebrate  assessments 
(Section  18.4),  3)  bioaccumulation  assessments  (Section  18.5),  4)  sediment  chemistry 
(Section  18.6),  and  5)  SQGs  (Section  18.7). 

Although  each  type  of  measurement  endpoint  may  contribute  to  risk-based  decision¬ 
making,  several  important  questions  remain  regarding  uncertainty.  For  example,  “Which 
toxicity  tests  provide  the  most  sensitive  and  realistic  measurements  for  evaluating  effects 
of  contaminants  on  benthic  communities?  Which  measurement  endpoints  for  benthic 
invertebrate  assessments  provide  the  best  linkage  to  sediment  contamination?  Which 
SQGs  should  be  used  to  evaluate  the  potential  biological  significance  of  sediment-associ¬ 
ated  chemicals  at  contaminated  areas?”  To  answer  these  and  other  similar  questions, 
information  is  needed  on  the  uncertainty  that  is  associated  with  each  of  the  measurement 
endpoints  that  are  commonly  used  in  SERAs.  It  would  be  ideal  to  quantify  the  absolute 
uncertainty  associated  with  each  measurement;  however,  such  an  assessment  is  currently 
not  possible.  Nonetheless,  it  is  possible  to  evaluate  the  relative  uncertainty  associated 
with  some  of  the  measurement  endpoints  commonly  used  in  SERAs. 

To  facilitate  this  evaluation,  we  established  a  series  of  criteria  to  support  consistent  as¬ 
sessments  of  the  uncertainty  associated  with  each  measurement  endpoint,  including  1) 
precision,  2)  ecological  relevance,  3)  causality,  4)  sensitivity,  5)  interferences,  6)  standard- 
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ization,  7)  discrimination,  8)  bioavailability,  and  9)  field  validation.  Each  of  these  criteria 
were  defined  in  the  context  of  the  type  of  measurement  endpoint  that  was  considered. 
For  example,  the  definitions  of  precision  are  different  for  the  evaluations  of  sediment 
chemistry  data  in  Section  18.6  compared  to  the  definition  for  precision  in  Section  18.7 
dealing  with  SQGs.  In  this  way,  the  relevance  of  the  evaluation  criteria  to  each  assessment 
was  assured.  Although  no  attempt  was  made  in  this  evaluation  to  recommend  specific 
measurements  to  apply  when  conducting  risk  assessments  with  sediment,  it  is  antici¬ 
pated  that  the  information  provided  will  allow  the  reader  to  identify  the  measurement 
endpoints  which  are  most  relevant  for  their  specific  applications.  Uncertainty  in  mea¬ 
surement  endpoints  associated  with  lack  of  knowledge  is  indicated  with  an  asterisk  in 
Tables  18-1  to  18-3  and  Tables  18-5  to  18-7  to  differentiate  from  systematic  uncertainty 
which  can  be  rectified  (methodologically )  or  quantified  (sampling  decisions  and  design). 

The  fact  that  no  recommendations  were  made  with  regard  to  a  single  preferred  endpoint 
reflects  a  lack  of  scientific  consensus  about  whether  any  single  approach  provides  the  best 
answer  for  risk  assessment.  There  was  a  consensus  that  each  endpoint  had  strengths  and 
weaknesses,  many  of  them  inherent,  and  that  the  best  approach  at  present  is  probably  to 
use  multiple  endpoints.  Implementation  of  robust  multiple  endpoint  studies  remains  in 
the  experimental  phase.  The  success  of  multidisciplinary  tools  such  as  the  sediment  qual¬ 
ity  triad  will  expand  upon  these  applications.  The  discussions  of  each  endpoint  in  the 
following  sections  were  developed  by  specialists  in  that  field  participating  at  the  Work¬ 
shop,  with  inputs  from  colleagues  in  other  specialties.  Perhaps  it  is  a  sign  of  the  relative 
immaturity  of  the  field  of  SERA  that  specialists  tended  to  be  more  optimistic  about  the 
uncertainties  associated  with  the  endpoints  in  their  area  of  specialty  and  sometimes  more 
critical  of  alternative  endpoints.  Rankings  in  Tables  18-1  to  18-3  and  Tables  18-5  to  18-7 
reflect  the  preferences  of  the  specialists  in  that  field.  Comparisons  of  rankings  are  valid 
within  endpoints  (e.g.,  within  Table  18-1)  but  not  necessarily  among  endpoints  (e.g.,  be¬ 
tween  Table  18-1  and  18-5).  As  this  field  matures,  it  is  expected  that  interrelationships 
among  specialists  will  grow  in  sophistication.  For  example,  the  results  of  future  commu¬ 
nity  assessments  may  identify  more  sensitive  species  for  laboratory  toxicity  testing,  the 
bioaccumulation  approaches  may  be  used  to  help  understand  the  complexities  of  natu¬ 
ral  exposures  at  a  site,  and  the  results  of  the  laboratory  toxicity  tests  could  be  used  to  help 
explain  changes  in  populations  or  communities.  The  paucity  of  knowledge  about  such 
interrelationships  is  an  example  of  uncertainties  associated  with  inadequate  knowledge 
that  are  broader  than  those  measured  by  the  approaches  discussed  in  Chapter  17.  The 

for  the  caution 


18.2  Evaluation  of  uncertainty  associated  with  phases  used 
to  conduct  laboratory  toxicity  tests  with  sediments 

Various  methods  have  been  developed  to  evaluate  sediment  toxicity.  These  procedures 
range  in  complexity  from  short-term  lethality  tests  which  measure  effects  of  individual 
contaminants  on  single  species  to  long-term  tests  which  determine  the  effects  of  chemi- 


knowledge  uncertainties  are  difficult  to  quantify,  but  they  are  the  reason 
that  many  scientists  express  about  overly  broad  applications  of  SERAs. 
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Table  18-1  Uncertainty  associated  with  sediment  phases  used  in  laboratory  toxicity  tests 


Evaluation  criteria 

Whole 

sediment: 

benthos 

Whole 

sediment: 

pelagic 

Organic 

extracts 

Suspended 

solids 

Elutriates 

Pore  water 

Precision 

1 

1 

1 

3 

1 

1 

Ecological  relevance 

1 

2 

3 

2 

3 

2 

Causality:  link 

3 

3 

3 

3 

3 

3 

Causality:  source 

1 

2 

3* 

3 

3 

2 

Sensitivity 

1 

2 

3 

3 

3 

2 

Interference 

2* 

2 

3 

3 

3 

2* 

Standardization 

1 

2 

3 

3 

1 

2 

Discrimination 

1* 

1* 

1* 

1* 

1* 

1* 

Bioavailability 

1* 

1* 

3 

1* 

3 

1* 

Field  validation 

1* 

2* 

3 

3* 

3 

3* 

1  =  low  uncertainty  (good) 
3  =  high  (bad) 

*  =  lack  of  knowledge 


cal  mixtures  on  the  growth  or  reproduction  of  test  organisms  or  structure  and  function 
of  communities.  The  test  organisms  might  include  bacteria,  algae,  macrophytes,  fishes, 
and  benthic,  epibenthic  and  pelagic  invertebrates  (Lamberson  and  Swartz  1988;  Burton 
1992).  Discussions  of  uncertainty  relative  to  laboratory  toxicity  tests  were  divided  into 
two  operational  categories:  1)  uncertainties  related  to  the  phase  tested  (in  this  section) 
and  2)  uncertainties  related  to  the  selection  of  endpoints  measured  in  toxicity  tests  (Sec¬ 
tion  18.3).  A  diverse  array  of  exposure  phases  have  been  used  in  sediment  toxicity  tests 
(Ankley,  Schubauer-Berigan,  and  Dierkes  1991;  Burton  1992;  Ingersoll  1995).  The 
present  discussion  of  uncertainty  focuses  on  six  principal  phases  typically  evaluated  in 
toxicity  tests:  1)  whole  sediment  using  benthic  invertebrates,  2)  whole  sediment  using 
pelagic  organisms,  3)  organic  extracts  of  whole  sediment,  4)  suspended  solids,  5)  elutri¬ 
ates,  and  6)  pore  water  isolated  fi:om  whole  sediment.  Within  the  category  of  whole  sedi¬ 
ment  tests,  we  chose  to  differentiate  between  tests  conducted  with  benthic  versus  pelagic 
organisms  because  of  uncertainty  with  respect  to  route  of  exposure  with  pelagic  organ¬ 
isms. 

Whole  sediment  toxicity  tests  were  developed  to  evaluate  the  effects  of  in-place  sediments 
(USEPA  1994a,  1994b;  ASTM  1995a-1995e;  Environment  Canada  1996a,  1996b).  Toxic¬ 
ity  tests  with  porewater  samples  isolated  from  sediment  were  developed  for  evaluating 
the  potential  in  situ  effects  of  contaminated  sediment  on  aquatic  organisms  (Ankley, 
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Phipps,  et  al.  1991;  Ankley,  Schubauer-Berigan,  and  Dierkes  1991).  For  many  benthic 
invertebrates,  the  toxicity  and  bioaccumulation  of  sediment-associated  contaminants 
such  as  metals  and  nonionic  organic  contaminants  have  been  correlated  with  concentra¬ 
tions  of  these  chemicals  in  pore  water  (Di  Toro  et  al  1991).  Toxicity  tests  with  organic 
extracts  were  developed  to  evaluate  effects  of  the  maximum  concentrations  of  organic 
contaminants  associated  with  a  sediment  (Chapman  and  Fink  1984).  Tests  with  elutriate 
samples  and  suspended  solids  measure  the  potential  release  of  contaminants  from  sedi¬ 
ment  to  the  water  column  during  disposal  of  dredged  material  or  during  sediment  resus¬ 
pension  events  (Shuba  et  al  1978;  ASTM  1995d). 

Each  of  these  phases  is  evaluated  in  relation  to  the  following  major  sources  of  uncertainty: 
precision,  ecological  relevance,  causality,  sensitivity,  interference,  standardization,  dis¬ 
crimination,  bioavailability,  and  field  validation  (Table  18-1).  The  uncertainty  associated 
with  each  phase  is  a  function  of  inherent  limitations  (e.g. ,  testing  of  whole  sediments  has 
greater  ecological  significance  than  organic  extracts)  and  the  stage  of  development  of  the 
response  as  a  toxicological  endpoint  {e.g,  whole  sediment  tests  are  much  better  devel¬ 
oped  than  porewater  tests).  The  available  information  does  not  support  a  quantitative 
evaluation  of  the  uncertainty  associated  with  each  of  the  sediment  phases  considered  in 
this  assessment.  For  this  reason,  an  evaluation  of  the  relative  uncertainty  of  various  sedi¬ 
ment  phases  was  conducted  to  provide  risk  assessors  with  guidance  for  the  application  of 
appropriate  toxicity  tests. 

18.2.1  Precision 

Precision  was  evaluated  in  terms  of  the  replicability  of  the  tests  commonly  performed  on 
various  sediment  phases.  With  respect  to  laboratory  precision  {le.,  precision  not  related 
to  sample  collection,  handling,  and  storage),  tests  with  most  of  the  fractions  listed  in 
Table  18-1  have  a  low  degree  of  uncertainty.  Round-robin  (ring)  tests  conducted  with 
whole  sediments  indicate  relatively  low  intra-  and  interlaboratory  variability  (Mearns  et 
al  1986,  USEPA  1994a,  1994b;  ASTM  1995a;  Burton  al  1996).  The  one  exception 
would  be  suspended-solids  tests,  where  the  workgroup  noted  the  methods  used  for  gen¬ 
erating  these  types  of  exposure  are  quite  variable  (ASTM  1995d). 

18.2.2  Ecological  relevance 

With  respect  to  the  evaluation  of  sediment  toxicity  tests  conducted  on  various  sediment 
phases,  ecological  relevance  was  evaluated  in  terms  of  its  linkage  to  the  receptors  which 
are  to  be  protected.  Whole  sediment  tests  using  resident  species  were  considered  to  pro¬ 
vide  the  most  realistic  information  for  assessing  organism  responses  (Table  18-1).  Be¬ 
cause  organic  extracts  may  alter  the  bioavailability  of  sediment-associated  contaminants, 
toxicity  tests  conducted  using  this  phase  were  considered  to  have  a  relatively  lower  level 
of  relevance.  Similarly,  elutriate  and  suspended-solids  tests  are  conducted  using  a  phase 
which  may  artificially  alter  the  availability  of  contaminants.  Although  water  column  spe¬ 
cies  have  been  demonstrated  to  be  sensitive  indicators  of  whole  sediment  toxicity  (Bur¬ 
ton  et  al  1992),  benthic  species  in  direct  contact  with  sediment  provide  a  more  direct 
assessment  of  sediment  contamination  (Ankley,  Schubauer-Berigan,  and  Dierkes  1991). 
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The  most  ecologically  relevant  test  systems  from  the  standpoint  of  uncertainty  were 
whole  sediment  tests  with  benthic  organisms,  followed  by  whole  sediment  tests  with 
pelagic  species,  suspended  sediment  tests,  and  porewater  tests  (Table  18-1).  All  of  these 
phases  represent  meaningful  and  interpretable  routes  of  exposure  with  respect  to  aquatic 
organisms,  albeit  for  slightly  different  reasons.  For  example,  pore  water  seems  to  be  a 
reasonable  surrogate  test  fraction  for  whole  sediments  (Giesy  and  Hoke  1989;  Ankley, 
Phipps,  et  al  1991;  Ankley,  Schubauer-Berigan,  and  Dierkes  1991;  Carr  and  Chapman 
1995),  whereas  whole  sediment  tests  with  pelagic  species  seem  reasonable  for  assessing 
the  potential  toxicity  of  contaminants  released  from  sediments  into  the  overlying  water 
(Burton  et  al  1992;  ASTM  1995a).  The  least  ecologically  relevant  test  systems  were  or¬ 
ganic  extracts  and  elutriates.  A  fraction  of  the  organic  extracts  are  comprised  of  contami¬ 
nants  that  are  not  (or  ever  would  be)  bioavailable  and  represent  an  arbitrary  situation  not 
expected  to  exist  in  the  environment.  There  is  inherent  variability  to  studies  that  have 
used  elutriates  as  predictive  of  potential  toxicity  of  sediments  in  situ.  In  terms  of  interre¬ 
lationships,  it  was  recognized  that  most  toxicity  tests  are  closed,  simplified  systems.  As 
such  they  may  have  difficulty  simulating  the  effects  of  contaminants  within  complex  food 
webs,  feedback  loops,  and  other  interactions  that  characterize  natural  ecosystems,  no 
matter  what  sediment  phase  is  employed  (Luoma  and  Ho  1993). 

18.2.3  Causality 

Determination  of  causality  {i.e.,  correctly  identifying  stressors)  is  of  fiindamental  impor¬ 
tance  in  SERAs.  For  this  reason,  uncertainty  in  the  phases  evaluated  in  toxicity  tests  was 
evaluated  in  terms  of  their  ability  to  determine  or  predict  the  factors  causing  adverse  ef¬ 
fects  in  contaminated  sediments.  With  respect  to  causality,  it  is  highly  uncertain  exactly 
which  contaminants  may  be  causing  observed  toxicity  without  the  use  of  sediment  chem¬ 
istry  data,  irrespective  of  the  system  tested.  Because  toxicity,  in  and  of  itself,  consists  only 
of  a  generic  biological  response,  no  test  with  the  phases  listed  in  Table  18-1  can  indicate 
specific  contaminants  of  concern.  For  this  interpretation,  it  is  necessary  to  link  the  toxic¬ 
ity  test  to  appropriate  mixture  toxicity  models,  spiked-sediment  tests,  or  TIE  procedures 
designed  specifically  for  defining  specific  compounds  or  classes  of  compounds  respon¬ 
sible  for  toxicity  (Chapter  16). 

On  another  level,  toxicity  tests  can  be  related  to  causality  in  a  more  generic  sense.  For 
example,  if  an  effect  is  observed  with  a  measure  of  benthic  community  structure,  appro¬ 
priate  toxicity  tests  can  be  used  to  indicate,  with  a  good  deal  of  certainty,  causality  in 
terms  of  the  field  response  resulting  from  toxicity  as  opposed  to  other  environmental 
stressors.  Also,  the  spatial  distribution  of  toxicity  can  indicate  the  location  of  sources  of 
contaminants  which  cause  adverse  responses.  In  these  instances,  whole  sediment  tests 
with  benthic  species  have  a  relatively  high  degree  of  certainty,  while  whole  sediment  tests 
with  pelagic  species  and  porewater  tests  have  lower  certainty.  The  test  systems  with  the 
greatest  uncertainty  in  this  category  included  organic  extracts,  elutriate,  and  suspended- 
solid  tests.  In  the  case  of  the  organic  extract  and  elutriate  tests,  the  exposure  regimes  do 
not  represent  a  realistic  measure  of  in  situ  conditions  of  sediment.  The  workgroup  was 
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unaware  of  any  evidence  that  has  been  collected  to  evaluate  this  issue  for  suspended-solid 
tests. 

18.2.4  Sensitivity 

Sensitivity  is  another  important  characteristic  of  measurement  endpoints  because  there 
is  a  need  to  reliably  identify  sediments  that  have  the  potential  to  affect  sensitive  species 
in  aquatic  ecosystems.  Sensitivity  should  minimize  uncertainty  associated  with  both  false 
positives  (nontoxic  sample  incorrectly  classified  as  toxic)  and  false  negatives  (toxic 
sample  incorrectly  classified  as  nontoxic).  Whole  sediment  tests  with  benthic  organisms 
typically  exhibit  an  appropriate  level  of  sensitivity.  This  is  not  to  say  that  all  toxicity  tests 
conducted  with  benthic  organisms  are  the  most  sensitive,  but  that  there  was  a  high  de¬ 
gree  of  certainty  that  the  response  measured  was  predictive  and  accurate  of  effects  of 
contaminants  in  sediment  {i.e.,  minimize  both  false  positives  and  false  negatives).  How¬ 
ever,  these  conclusions  are  based  primarily  on  results  of  toxicity  tests  which  monitor  only 
short-term  effects  {i.e.,  10-d  survival).  Additional  emphasis  should  be  placed  on  develop¬ 
ing  and  evaluating  chronic  toxicity  test  methods  {i.e.,  longer  exposures  measuring  effects 
on  growth  and  reproduction;  Kemble  et  al  1994;  Benoit  et  al  1996;  Sibley  et  al  1996). 
There  was  not  a  consensus  in  the  group  that  some  of  the  subtle,  but  important,  effects 
which  might  occur  in  nature  were  measured  with  a  high  degree  of  sensitivity  in  the  whole 
sediment  bioassays,  even  though  they  were  often  the  best  choice  of  sediment  phase.  An 
intermediate  level  of  uncertainty  was  assigned  to  the  sensitivity  of  whole  sediment  tests 
with  pelagic  species  and  tests  conducted  with  porewater  samples.  The  highest  relative 
uncertainty  is  associated  with  test  systems  consisting  of  organic  extracts,  elutriates,  and 
suspended  sediments.  Relative  sensitivity  is  highly  dependent  on  the  species  selected  for 
testing  in  addition  to  the  phase  tested.  It  is  recommended  that  additional  studies  be  con¬ 
ducted  comparing  the  sensitivity  of  single  species  tested  in  a  variety  of  sediment  phases 
to  the  response  of  populations  and  communities  in  the  field. 

18.2.5  Interferences 

Interferences  are  considered  to  be  related  to  biotic  or  abiotic  factors  which  could  influ¬ 
ence  toxicity  test  results  beyond  the  direct  effects  of  specific  contaminants.  That  is,  inter¬ 
ferences  are  considered  to  be  nontreatment  factors  {i.e.,  noncontaminant  effects).  All  of 
the  test  systems  have  moderate  to  high  uncertainty  with  regard  to  interferences.  Factors 
such  as  particle  size,  OC  content,  salinity,  BOD,  and  the  presence  (or  absence)  of  nutri¬ 
ents  can  potentially  affect  the  results  of  whole  sediment  tests  with  both  benthic  and  pe¬ 
lagic  species  (DeWitt  et  al.  1988, 1989;  Ankley,  Benoit,  et  al  1994;  Suedel  and  Rodgers 
1994).  In  the  case  of  organic  extracts,  sample  integrity  is  so  disrupted  that  observed  tox¬ 
icity  could  be  solely  artifactual  {e.g.,  resulting  from  the  presence  of  solvent).  An  important 
factor  which  can  cause  uncertainty  with  regard  to  tests  with  suspended  sediments  is  the 
differentiation  of  toxicity  caused  by  contaminants  from  that  caused  by  physical  effects  of 
suspended  solids  (Chapman  et  al  1987).  A  major  source  of  uncertainty  in  the  interpreta¬ 
tion  of  aqueous  phase  (elutriate,  pore  water)  sediment  tests  is  pH  drift  which  can  change 
by  one  or  more  units  (usually  upward)  over  the  course  of  tests  with  these  fractions.  Such 
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changes  can  alter  the  toxicity  of  common  sediment  contaminants  such  as  metals,  hydro¬ 
gen  sulfide,  and  ammonia  (Ankley  and  Thomas  1992;  Ankley  and  Schubauer-Berigan 
1995).  If  aqueous  phases  are  removed  from  partially  anoxic  sediments,  rapid  precipita¬ 
tion  of  ferrous  iron  in  the  oxidized  conditions  of  the  bioassay  will  remove  metals  and  bias 
toxicity  downward  (Luoma  1995). 

18.2.6  Standardization 

In  the  context  of  toxicity  tests,  standardization  was  evaluated  in  terms  of  the  level  of  peer 
review  and  the  publication  of  standard  methods.  There  is  little  uncertainty  related  to  the 
availability  of  standard  methods  for  preparing  and  testing  sediments  with  benthic  spe¬ 
cies  or  with  elutriates  (ASTM  1995a-1995e;  USEPA  1994a,  1994b;  Environment  Canada 
1996a,  1996b).  A  moderate  amount  of  uncertainty  exists  with  regard  to  standard  meth¬ 
ods  for  testing  pelagic  species  in  whole  sediment  tests  (ASTM  1995a)  or  for  isolating  and 
testing  pore  water  (ASTM  1995b).  However,  several  recently  published  studies  may  aid 
in  the  standardization  of  porewater  preparation  for  toxicity  testing  (Schults  et  al  1992; 
Ankley  and  Schubauer-Berigan  1995;  Carr  and  Chapman  1995).  ASTM  has  recently  de¬ 
scribed  general  guidance  for  conducting  tests  with  suspended  solids  (ASTM  199 5d).  To 
our  knowledge,  there  are  no  standardized  (or  widely  accepted)  methods  for  testing  or¬ 
ganic  extracts  prepared  from  sediments. 

18.2.7  Discrimination 

This  criterion  was  intended  to  evaluate  the  ability  of  a  sediment  phase  used  in  a  toxicity 
test  to  discriminate  between  contaminated  sediments  by  providing  a  graded  response. 
All  six  types  of  test  fractions  have  the  potential  for  low  uncertainty  with  respect  to  dis¬ 
criminating  among  samples  {i.e.,  the  ability  to  distinguish  among  samples  along  a  stres¬ 
sor  gradient).  However,  it  should  be  noted  that  the  ability  to  discriminate  among  samples 
is  more  a  function  of  the  biological  endpoint  chosen  than  of  the  sample  fraction  tested 
(Chapter  16). 

18.2.8  Bioavailabilily 

Information  from  a  variety  of  sources  indicates  that  the  characteristics  of  the  sediments 
and  the  associated  pore  water  can  influence  the  bioavailability  and  toxicity  of  sediment- 
associated  contaminants  (Pavlou  1987;  Di  Toro  et  al  1990;  Swartz  et  al  1990).  For  this 
reason,  toxicity  tests  that  explicitly  considered  test  phases  that  can  be  used  to  evaluate  the 
influence  of  these  factors  have  a  lower  level  of  uncertainty  than  those  which  did  not  ad¬ 
dress  bioavailability  (Di  Toro  etal  1991).  Toxicity  tests  cannot  be  used  alone  to  evaluate 
the  bioavailability  of  contaminants.  Synoptic  generation  of  appropriate  chemistry  data  is 
needed  in  combination  with  the  toxicity  test  (Chapter  14).  If  this  is  done,  the  question  of 
uncertainty  is  then  related  to  the  relevance  of  the  test  from  a  standpoint  of  realistic  expo¬ 
sure  {le.,  ecological  relevance).  Thus,  tests  with  the  highest  degree  of  uncertainty  are 
those  with  a  questionable  formulation  from  an  environmental  standpoint  (the  solvent 
extract  and  elutriates).  The  other  four  test  phases  —  whole  sediment  tests  with  benthic 
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and  pelagic  species,  suspended  solids,  and  porewater  tests  —  all  have  low  uncertainty 
with  respect  to  assessing  the  biological  availability  of  contaminants  in  different  scenarios. 

18.2.9  Field  validation 

Field  validation  is  an  essential  element  of  toxicity  testing  because  it  provides  a  basis  for 
assessing  reliability  in  sediment  quality  assessments.  In  this  context,  toxicity  tests  were 
considered  to  have  low  uncertainty  if  they  were  shown  to  be  predictive  of  responses  of 
benthic  communities  in  the  field.  None  of  the  phases  listed  in  Table  18-1  has  received 
enough  attention  from  the  standpoint  of  field  validation;  further  research  is  required  in 
all  instances.  However,  some  data  do  exist  to  suggest  that  certain  whole-sediment  tests 
can  be  predictive  of  population-  and  community-level  responses  in  the  field  (Swartz  etal 
1994;  Canfield  et  al.  1994, 1996;  Chapter  8).  Therefore,  the  general  consensus  was  that 
there  was  a  reasonable  amount  of  certainty  with  respect  to  this  extrapolation.  Similarly, 
good  correspondence  was  observed  between  the  results  of  whole  sediment  toxicity  tests 
with  pelagic  cladocerans  and  measures  of  benthic  community  structure  in  several  Great 
Lakes  tributaries  (Indiana  Harbor,  Buffalo  River,  Saginaw  River;  Burton  and  Ingersoll 
1994).  Others  in  the  workgroup  suggested  it  may  be  more  difficult  to  simulate  the  com¬ 
plexities  of  nature  in  simple  whole-sediment  tests  and  pointed  out  examples  where  tox¬ 
icity  tests  did  not  predict  outcomes  in  nature  (Schindler  1987;  Bryan  and  Gibbs  1991; 
Luoma  and  Carter  1991).  The  degree  of  uncertainty  associated  with  laboratory-to-field 
extrapolation  of  biological  tests  with  organic  extracts  of  sediments,  suspended  sedi¬ 
ments,  elutriates,  and  pore  water  have  not  been  thoroughly  examined. 

18.2.10  Summary 

Whole  sediment  tests  were  considered  to  provide  the  most  realistic  phase  for  assessing 
organism  response.  Because  organic  extracts  may  alter  the  bioavailability  of  sediment- 
associated  contaminants,  toxicity  tests  conducted  using  this  phase  were  considered  to 
have  a  relatively  lower  level  of  relevance.  Similarly,  elutriate  and  suspended-solids  tests 
are  conducted  using  a  phase  which  may  artificially  alter  the  availability  of  contaminants. 
In  order  to  establish  cause-and-effect  relationships,  it  is  necessary  to  link  the  toxicity  test 
to  appropriate  mixture  toxicity  models,  spiked-sediment  tests,  or  TIE  procedures  de¬ 
signed  specifically  for  identifying  specific  compounds  or  classes  of  compounds  respon¬ 
sible  for  toxicity  (Chapter  16). 

18.3  Evaluation  of  the  uncertainty  associated  with  endpoints 
measured  in  laboratory  toxicity  tests  with  sediments 

A  diverse  array  of  response  endpoints  have  been  measured  in  sediment  toxicity  tests 
(Lamberson  and  Swartz  1988;  Burton  et  al  1992;  Lamberson  et  al  1992;  Burton  and 
Ingersoll  1994).  Uncertainty  associated  with  toxicity  endpoints  was  evaluated  rather  than 
assessing  the  types  of  species  that  could  be  tested  in  laboratory  exposures.  Others  have 
addressed  the  advantages  and  disadvantages  of  specific  tests  or  specific  organisms  used 
in  sediment  studies  {e.g.,  Swartz  1989;  Burton  1992;  Ankley,  Collyard,  etal  1994;  USEPA 
1994a,  1994b;  Ingersoll  1995).  The  emphasis  on  endpoints  is  not  intended  to  imply  that 
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any  particular  endpoint  is  preferable  over  another  with  respect  to  ecological  relevance  or 
overall  sensitivity.  For  example,  a  “chronic”  test  with  sublethal  endpoints  with  a  relatively 
insensitive  species  (e.g.,  growth  or  reproduction  with  polychaetes)  might  be  far  less  eco¬ 
logically  relevant  or  sensitive  compared  to  an  acute  test  with  survival  as  the  primary  end¬ 
point  with  a  sensitive  species  (e.g.,  survival  of  amphipods). 

The  present  discussion  of  uncertainty  focuses  on  seven  principal  classes  of  response  end¬ 
points  that  are  often  measured  in  toxicity  tests,  including  survival,  growth,  reproduction, 
behavior,  life  tables,  development,  and  biomarkers.  Each  of  the  response  endpoints  is 
evaluated  in  relation  to  the  major  sources  of  uncertainty:  precision,  ecological  relevance, 
causality,  sensitivity,  interference,  standardization,  discrimination,  bioavailability,  and 
field  validation  (Table  18-2).  The  uncertainties  associated  with  each  of  the  endpoints  are 
a  function  of  inherent  limitations  (e.g.,  reproduction  has  greater  ecological  significance 
than  biomarkers)  and  the  stage  of  development  of  the  response  as  a  toxicological  end¬ 
point  (e.g.,  acute  lethality  tests  are  much  better  developed  than  chronic  reproductive 
tests).  Uncertainty  also  varies  among  specific  toxicity  tests  that  use  the  same  endpoint 
(e.g.,  interference  of  low  salinity  on  amphipod  survival  is  substantial  ior  Rhepoxynius  abro- 
nius  but  insignificant  for  Eohaustorius  estuarius;  ASTM  1995c).  The  following  comparison 
of  uncertainty  is  based  on  the  best  developed  examples  of  toxicity  tests  for  each  of  the 
response  endpoints. 

18.3.1  Precision 

Precision  was  evaluated  relative  to  the  replicability  of  endpoint  responses.  The  lowest 
level  of  uncertainty  was  assigned  to  survival,  growth,  behavior,  and  development.  With 
respect  to  the  behavior  and  development  endpoints,  our  evaluation  of  uncertainty  was 
based  on  the  following  tests:  emergence  and  reburial  of  estuarine  and  marine  amphipods 
(ASTM  1995b),  48-h  development  of  echinoderm  and  mollusc  larvae  (USEPA 1991),  and 
emergence  of  midges  (ASTM  1995a;  Benoit  etal  1996;  Sibley  etal  1996).  Other  behav¬ 
ioral  and  developmental  endpoints  may  be  much  more  variable  than  the  measures  evalu¬ 
ated  in  this  exercise.  The  precision  of  measurements  of  survival,  growth,  behavior,  and 
development  is  sufficient  to  allow  for  adequate  statistical  power  with  relatively  few  rep¬ 
licates.  For  example,  comparisons  (n=365)  of  mean  survival  of  the  amphipod  Rhepoxynius 
abronius  in  treatment  and  control  means  of  sediments  with  five  replicates  showed  that 
differences  of  25%  or  more  were  always  statistically  significant  (Mearns  etal.  1986).  High 
uncertainty  in  precision  was  assigned  to  life  table  measurements  and  biomarkers  because 
of  the  inherent  variability  in  these  endpoints  (Benson  and  Di  Giulio  1992;  DeWitt  et  al. 
1992).  See  USEPA  (1994a,  1994b),  ASTM  (1995a),  and  Burton  etal.  (1996)  for  additional 
discussion  of  statistical  power  associated  with  sediment  toxicity  tests. 

18.3.2  Ecological  relevance 

Ecological  relevance  was  evaluated  in  terms  of  the  linkage  between  response  endpoints 
and  the  ecological  resources  to  be  protected.  Survival  is  a  highly  relevant  endpoint  for 
measuring  both  acute  and  chronic  effects.  For  example,  survival  of  amphipods  exposed 
to  field-collected  sediments  in  10-  to  28-d  toxicity  tests  has  been  correlated  with  abun- 
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Table  18-2  Uncertainty  associated  with  endpoints  measured  in  laboratory  toxicity  tests  with  sediment 


Evaluation  criteria 

Survival 

Growth 

Repro¬ 

duction 

Behavior 

Life  tables 

Develop¬ 

ment 

Biomarkers 

Precision 

1 

1* 

2* 

1* 

3* 

1 

3* 

Ecological  relevance 

1 

2* 

1* 

2* 

1* 

2* 

3* 

Causality;  link 

3 

3 

3 

3 

3 

3 

2 

Causality:  source 

1 

2* 

2* 

2* 

3* 

1 

2 

Sensitivity 

1 

2 

1 

2 

2* 

2 

1* 

Interference 

1* 

2* 

3* 

2* 

3* 

2* 

3 

Standardization 

1 

2 

2 

1 

3 

2* 

3 

Discrimination 

2 

1 

1 

2 

2* 

1 

2* 

Bioavailability 

1 

1 

1 

1 

1 

1 

1 

Field  validation 

1 

2* 

2* 

1 

3* 

2 

3 

1  =  low  uncertainty  (good) 
3  =  high  (bad) 

*  =  lack  of  knowledge 


dance  of  amphipods,  species  richness,  and  other  measures  of  community  structure  in  the 
field  (Ferraro  et  al  1991;  Schlekat  et  al  1994;  Swartz  et  al  1994),  Also,  growth  of  the  am- 
phipod  Hyalella  azteca  or  the  midge  Chironomus  tentans  have  been  correlated  to  measures 
of  community  structure  ( Wentsel  et  al  1977;  Giesy  et  al  1988;  Burton  and  Ingersoll  1994; 
Canfield  et  al  1994, 1996).  In  contrast,  longer  term  studies  (several  generations)  show 
how  amphipod  populations  can  suffer  eventual  extinction  at  toxicant  levels  below  those 
that  affect  survival  (Sundelin  1983),  and  several  authors  have  discussed  the  challenges  of 
predicting  the  more  subtle  effects  of  toxicants  in  nature  with  short-term  tests  of  survival 
(Schindler  1987;  Luoma  1995).  Reproduction  and  population  dynamics  (life  tables)  are 
relevant  measures  of  chronic  effects.  However,  substantial  uncertainly  is  associated  with 
many  existing  biomarkers  relative  to  ecological  responses  (Benson  and  Di  Giulio  1992) 
because  biomarkers  often  reflect  exposure  to  contaminants  rather  than  adverse  effects. 
More  study  is  needed  to  determine  if  changes  in  population  dynamics  and  biomarker- 
type  responses  precede  more  relevant  ecological  responses  as  warning  signals  (Schindler 
1987;  Widdows  and  Donkin  1989;  Luoma  1995). 

18.3.3  Causality 

Uncertainty  associated  with  causal  relations  in  toxicity  endpoints  was  evaluated  in  two 
ways.  First,  causality  was  assessed  with  respect  to  the  correct  linkage  to  the  specific  stress 
factors  which  caused  the  toxicological  response.  Second,  causality  was  evaluated  with 
respect  to  the  location  of  the  source  of  the  stress  factors.  Determination  of  causal  linkages 
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(i.e.,  correctly  identifying  stressors)  can  be  of  fundamental  importance  in  SERAs.  How¬ 
ever,  toxicological  responses  are  usually  generic  indicators  of  stress  and  cannot,  by  them¬ 
selves,  discriminate  specific  factors  which  caused  the  response.  This  limitation  is  the 
principal  justification  for  including  all  three  elements  of  the  sediment  quality  triad  {Le., 
biology,  chemistry,  toxicity)  in  comprehensive  SERAs  (Long  and  Chapman  1985; 
Chapman  et  al  1992).  Except  for  biomarkers,  all  of  the  response  endpoints  were  assigned 
a  high  degree  of  uncertainty  with  respect  to  Enkages  (Table  18-2).  Some  biomarkers  have 
less  uncertainty  because  they  can  be  used  to  identify  specific  classes  of  contaminants  {e.g., 
metallothionein  and  certain  metals;  cytochrome  P-450  induction  and  certain  organic 
compounds;  Brown  etal  1985;  Benson  and  Di  Giulio  1992;  Huggett,  Kimerle,  etal  1992; 
Huggett,  Van  Veld,  et  al  1992).  The  different  measurement  endpoints  can  be  used  to 
identify  source  location  as  a  reflection  of  spatial  distribution  of  toxicity.  This  use  of  the 
endpoints  listed  in  Table  18-2  to  identify  sources  responsible  for  effects  is  best  developed 
for  survival,  growth,  and  development  endpoints  (Chapman  et  al  1991;  Swartz  et  al 
1994;  Canfield  et  al  1994).  Higher  uncertainty  is  associated  with  use  of  life  table  measure¬ 
ments  primarily  because  of  a  lack  of  application  of  these  analyses  to  evaluate  contami¬ 
nated  sediment. 

18.3.4  Sensitmly 

Sensitivity  is  an  important  characteristic  of  risk  assessment  measurement  endpoints 
because  there  is  a  need  to  reliably  identify  sediments  with  high,  moderate,  and  low  prob¬ 
abilities  of  causing  adverse  biological  effects.  Therefore,  endpoints  were  considered  to 
have  a  relatively  low  level  of  uncertainty  in  this  respect  if  they  reliably  determine  effects 
at  relatively  low  contaminant  concentrations  {le.,  minimize  false  negatives,  although 
allowing  a  higher  probabiUty  of  false  positives).  All  of  the  endpoints  have  low  to  moder¬ 
ate  uncertainty  with  respect  to  sensitivity.  However,  the  sensitivity  of  biomarkers  may  not 
be  ecologically  relevant  (Benson  and  Di  Giulio  1992).  Furthermore,  measurement  of 
chronic  effects  does  not  always  equate  with  increased  sensitivity.  For  example,  measure¬ 
ments  of  reproduction  in  the  oligochaete  Lumbriculus  variegatus  is  typically  less  sensitive 
than  measurements  of  acute  10-d  lethality  with  the  amphipod  HyaleUa  azteca  (Phipps  et 
al  1995).  Sensitivity  tends  to  be  species  and  contaminant-specific.  Use  of  multiple  test 
species  is,  therefore,  recommended  for  SERAs.  It  should  also  be  noted  that  in  at  least 
some  of  the  cases  where  effects  of  moderate  contamination  have  been  thoroughly  stud¬ 
ied,  ecosystem  changes  may  occur  at  contaminant  levels  below  those  which  had  effects  in 
laboratory  toxicity  or  bioaccumulation  tests  (Schindler  1987;  Bryan  and  Gibbs  1991).  So 
the  uncertainties  inherent  in  the  toxicity  testing  approach  may  introduce  insensitivities 
which  will  not  be  quantifiable  until  responses  of  organisms  and  communities  to  sedi¬ 
ment  contamination  in  nature  are  better  understood  or  until  better  chronic  testing  meth¬ 
ods  are  developed. 

18.3.5  Interferences 

Interferences  are  biotic  or  abiotic  factors  which  could  influence  response  endpoints  be¬ 
yond  the  direct  effects  of  specific  contaminants.  That  is,  interferences  are  considered  to 
be  not-treatment  factors.  Many  natural  sediment  features  (grain  size,  salinity,  tempera- 
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ture,  hardness)  can  affect  responses  if  they  exceed  the  tolerance  limits  of  the  test  species 
(DeWitt  et  al  1988, 1989;  Ankley,  Benoit,  et  al  1994;  Kohn  et  al  1994). 

The  potential  for  interferences  often  increases  with  the  duration  of  the  exposure.  Thus, 
uncertainty  with  respect  to  interference  is  lowest  for  survival  in  acute  exposures  {i.e., 
10-d  tests).  Interference  may  be  greater  in  studies  of  chronic  effects  on  reproduction  and 
population  dynamics.  The  presence  of  indigenous  organisms  can  confound  chronic  end¬ 
points  {i.e.,  growth;  Reynoldson  et  al.  1994).  Some  biomarker  responses  {e.g.,  stress  pro¬ 
teins)  may  be  associated  with  sediment  features  or  other  noncontaminant  factors 
(Benson  and  Di  Giulio  1992).  Ultraviolet  light  causes  the  photoactivation  and  increased 
toxicity  of  some  PAHs  (Ankley,  Collyard,  et  al.  1994).  Thus,  the  absence  of  ultraviolet 
(UV)  light  in  the  fluorescent  lighting  typically  used  in  laboratories  is  an  interference  that 
may  result  in  an  underestimation  of  potential  toxicity  under  natural  conditions.  Interfer¬ 
ences  can  be  minimized  through  the  selection  of  appropriate  methods  and  adherence  to 
rigorous  quality  assurance  plans.  Of  particular  importance  is  the  selection  of  a  test  spe¬ 
cies  that  would  tolerate  the  test  sediments  in  the  absence  of  contaminants. 

18.3.6  Standardization 

Uncertainty  in  the  degree  of  standardization  of  response  endpoints  was  evaluated  on  the 
basis  of  appropriate  peer  review.  That  is,  the  degree  of  uncertainty  was  considered  to  be 
lower  for  those  approaches  that  have  been  published  in  the  peer-reviewed  literature  and 
in  standard  methods.  Standard  methods  have  been  developed  for  virtually  all  of  the  end¬ 
points  listed  in  Table  18-2  except  for  life  tables  and  biomarkers.  However,  ASTM  recently 
convened  a  symposium  dedicated  to  biomarkers  and  established  a  subcommittee  to  de¬ 
velop  standard  guidance.  Standard  methods  for  acute  toxicity  tests  have  been  developed 
by  a  variety  of  organizations  (ASTM  1995a-1995e;  USEPA  1994a,  1994b;  Environment 
Canada  1992, 1996a,  1996b)  and  standard  sediment  methods  are  being  developed  for 
the  chronic  endpoints  (DeWitt  et  al  1992;  ASTM  1995a;  Benoit  et  al  1996;  Sibley  et  al 
1996). 

18.3.7  Discrimination 

Discrimination  was  evaluated  in  terms  of  the  potential  for  response  endpoints  to  define 
the  degree  of  toxicity  associated  with  contaminated  sediments.  Chronic  endpoint  re¬ 
sponses  have  the  potential  to  be  more  discriminatory  than  survival.  Survival  often  ap¬ 
proximates  a  binary  (all  or  none)  response  and  may  not  distinguish  between  marginally 
contaminated  samples.  However,  growth,  reproduction,  and  population  dynamics  may 
be  able  to  discriminate  among  nonlethal  sediments  (Kemble  etal  1994).  Biomarkers  and 
behavior  have  greater  uncertainty  because  of  limited  applications  with  contaminated 
sediments.  Discrimination  of  the  relative  contamination  and  toxicity  of  sediments  that 
cause  complete  mortality  can  be  accomplished  by  testing  a  series  of  dilution  mixtures  of 
the  toxic  sediment  with  clean  (negative  control)  sediment  (Swartz  etal  1989).  However, 
dilution  of  contaminated  sediments  with  a  clean  sediment  may  result  in  artifacts  with 
respect  to  concentration-response  relationships  (Nelson  etal  1993). 
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18.3.8  Bioavailability 

Both  chemical  and  toxicological  data  are  needed  to  establish  bioavailability  of  sediment 
contaminants.  In  sediment  spiking  experiments,  there  is  low  uncertainty  about  bioavail¬ 
ability  when  a  biological  response  results  from  sediment  exposure  if  1)  the  contaminant 
and  response  data  are  correlated,  2)  an  appropriate  experimental  design  is  used,  and  3) 
quality  assurance  objectives  are  met.  The  bioavailability  of  contaminants  in  field-col¬ 
lected  sediments  is  less  certain  because  of  the  difficulty  of  establishing  causal  relations 
between  response  endpoints  and  specific  chemicals  (Chapter  16).  Toxicity  tests  that  in¬ 
corporate  TIE  procedures  can  be  used  to  determine  the  bioavailability  of  specific  chemi¬ 
cals  (Chapter  16). 

18.3.9  Field  validation 

Field  validation  of  endpoints  is  essential  to  reduce  laboratory  to  field  extrapolation  error. 
Uncertainty  in  field  validation  is  lowest  for  survival  tests.  For  example,  field-collected 
sediments  which  kill  amphipods  in  laboratory  tests  have  been  shown  to  be  associated 
with  areas  where  amphipods  are  rare  or  absent  (Ferraro  et  al.  1991;  Swartz  et  ah  1994). 
Chronic  endpoints  have  also  been  field  validated  to  a  lesser  degree  (DeWitt  et  al  1992; 
Canfield  et  al  1994;  Chapter  8).  The  life  table  assessment  is  a  promising  endpoint,  but 
there  has  been  little  opportunity  to  date  for  field  validation.  The  field  validation  of  biom¬ 
arkers  in  relation  to  adverse  biological  effects  has  been  equivocal.  As  methods  for  evalu¬ 
ating  effects  on  communities  become  more  sophisticated,  rigorous  and  quantitative 
comparisons  with  toxicity  endpoints  will  become  possible.  Such  comparisons  should 
reduce  the  uncertainties  which  lead  to  the  caution  many  ecologists  feel  about  compari¬ 
sons  with  simple  measures  of  communities  (Diaz  1992). 

18.3.10  Summary 

The  uncertainty  associated  with  survival  is  less  than  that  of  the  other  endpoints  used  in 
sediment  toxicity  tests  (Table  18-2).  This  is  because  mortality  is  an  extreme  response  with 
obvious  biological  consequences.  Also,  a  substantial  literature  concerning  survival  in 
sediment  toxicity  tests  has  been  created  during  the  last  two  decades.  Biomarkers  have 
significant  sources  of  uncertainty  as  sediment  toxicological  endpoints,  especially  with 
respect  to  ecological  relevance  and  interferences  by  nontreatment  factors.  The  continued 
development  of  more  sensitive  and  ecologically  relevant  endpoints  {e.g.,  chronic  effects 
on  growth  and  reproduction,  life-cycle  tables)  has  the  potential  to  produce  superior  mea¬ 
surement  endpoints  for  use  in  SERAs. 

Toxicity  tests,  in  and  of  themselves,  are  not  useful  for  identifying  contaminants  respon¬ 
sible  for  observed  responses.  Even  linkage  of  test  results  to  the  list  of  chemicals  generated 
during  an  exposure  assessment  might  prove  to  be  of  limited  use  for  defining  potential 
causes  of  toxicity  for  a  number  of  reasons:  1)  chemicals  responsible  for  toxicity  may  not 
have  been  measured,  2)  the  bioavailability  of  chemicals  in  either  pore  water  or  in  whole 
sediment  can  be  quite  uncertain,  and  3)  correlative  techniques  {Le.,  comparison  of  re¬ 
sponses  to  chemical  concentrations)  are  often  unable  to  deal  with  multiple  contributions 
from  complex  mixtures. 
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Toxicity  identification  evaluation  methods  provide  a  useful  approach  for  assessing  toxic¬ 
ity  contributions  in  sediment  phases  where  unmeasured  contaminants  may  be  respon¬ 
sible  for  toxicity  or  where  there  are  questions  as  to  appropriate  bioavailability  or  mixture 
toxicity  models  (Chapter  16).  Toxicity  identification  evaluation  methods  consist  of  toxic¬ 
ity-based  fractionation  schemes  which  are  capable  of  identifying  toxicity  due  either  to 
single  compounds  or  broad  classes  of  contaminants  with  similar  properties.  Overall,  TIE 
methods  offer  a  logical  solution  for  bridging  the  gap  between  effects  and  exposure  assess¬ 
ments,  and  would  result  in  a  more  accurate  and  comprehensive  characterization  of  risk. 
Sediment  TIEs  have  typically  been  conducted  using  pore  water  as  the  test  phase;  however, 
methods  are  being  developed  for  testing  whole  sediments  (Ankley  and  Schubauer- 
Berigan  1995). 

18.4  Evaluation  of  uncertainty  associated  with  benthic 
invertebrate  assessments  of  sediments 

Under  the  category  of  benthic  invertebrate  assessments,  an  array  of  measurement  tech¬ 
niques  are  included.  We  have  not  considered  the  use  of  the  tools  at  a  fine  level  of  detail 
(e.g.,  type  of  sampling  device,  statistical  method),  as  this  is  beyond  the  scope  of  this  re¬ 
view.  A  recent  volume  edited  by  Rosenberg  and  Resh  (1993)  describes  the  current  state  of 
benthic  biomonitoring.  Rather,  we  have  evaluated  uncertainty  at  different  scales  of  orga¬ 
nization  and  have  made  the  assumption  that  users  will  use  the  optimal  or  most  appropri¬ 
ate  techniques  available  and  be  familiar  with  the  current  state  of  knowledge.  In  those 
cases  where  new  techniques  are  in  development,  they  have  been  specifically  mentioned. 

Benthic  assessment  methods  were  classified  at  different  organizational  scales,  from  the 
individual  to  the  community  level  (Table  18-3).  The  types  of  endpoints  included  at  these 
different  organizational  scales  are  1)  individual  (e.g.,  morphological  changes,  biomark¬ 
ers),  2)  population  (e.g.,  indicator  or  keystone  species  abundance,  population  size  struc¬ 
ture  and  life  history  modifications),  3)  community  structure  (e.g.,  indices,  metrics, 
multivariate  approaches),  and  4)  community  function  (e.g.,  functional  groups,  energy 
transfer,  size  spectra).  Although  community  ftinction  was  considered,  there  is  little  infor¬ 
mation  on  the  use  and  application  of  community  function  in  sediment  assessment. 
Therefore,  the  degree  of  uncertainty  associated  with  their  use  is  high  because  of  lack  of 
knowledge  (Chapter  9). 

The  primary  purpose  of  benthic  invertebrate  measurement  endpoints  (e.g,  indicators  or 
measurement  tools)  is  to  identify  departure  of  the  endpoint  from  either  an  expected  or 
predicted  condition,  given  normal  variability  in  both  time  and  space  (Chapter  10).  Fur¬ 
thermore,  measurement  endpoints  should  relate  such  a  departure  to  a  directional  stres¬ 
sor.  There  are  an  array  of  uncertainties  associated  with  such  a  judgment  and  the  decision 
on  whether  the  measured  state  has  departed  from  the  expected  value  requires  criteria  or 
guidelines  for  making  the  judgment.  We  have  identified  three  major  sources  of  uncer¬ 
tainty  which  are  associated  with  1)  lack  of  knowledge  of  the  system  about  which  measure¬ 
ments  are  being  made  ( i.e.,  what  is  a  normal  or  expected  state  for  this  system),  2)  a 
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Table  18-3  Uncertainty  associated  with  benthic  community  assessments 


Evaluation  criteria 

Individual 

Population 

Structure 

Function 

Precision 

1 

1 

2 

3* 

Ecological  relevance 

3 

2 

1 

3* 

Causality:  contamination 

2* 

2* 

2* 

3* 

Causality:  source 

2* 

3* 

3* 

3* 

Sensitivity 

1* 

1 

2 

3* 

Interference 

2* 

3* 

3* 

3* 

Standardization 

3* 

1 

1 

3* 

Discrimination 

2 

1 

1 

3* 

Bioavailability 

2* 

NA 

NA 

3* 

Field  validation 

3* 

1 

1 

3* 

1  =  low  uncertainty  (good) 
3  =  high  (bad) 

*  =  lack  of  knowledge 
NA=  not  applicable 


systematic  error  in  the  method  being  used,  and  3)  the  sampling  scale  selected  and  inher¬ 
ent  to  the  variable  being  measured. 

The  rest  of  this  section  discusses  relative  degree  of  uncertainty  associated  with  the  vari¬ 
ous  measurement  endpoints  (Table  18-3)  and  attempts  to  distinguish  uncertainty  asso¬ 
ciated  with  lack  of  knowledge  (indicated  with  an  asterisk)  from  systematic  or  stochastic 
uncertainty  which  can  be  rectified  (methodologically)  or  quantified  (sampling  decisions 
and  design).  We  have  categorized  each  type  of  measurement  endpoint  into  various  po¬ 
tential  sources  of  uncertainty,  each  are  described  separately  and  recommendations  are 
made  on  how  to  reduce  uncertainty  in  each  category. 

18.4.1  Precision 

Sampling  precision  refers  to  the  ability  to  define  and  reduce  variability  associated  with 
the  measurement.  The  ability  to  quantify  and  reduce  this  source  of  uncertainty  is  meth¬ 
odological,  cost  dependent,  and  primarily  related  to  the  scale  of  sample  in  relation  to  the 
“population”  being  estimated.  In  reality,  the  precision  decreases  as  the  scale  of  organiza¬ 
tion  increases,  thus  measurement  of  communities  is  less  precise  than  measurement  of  an 
individual.  However,  the  uncertainty  of  measurements  at  the  community  level  can  be 
quantified  and  reduced  by  appropriate  design  and  effort  (Elliott  1977;  Green  1979).  It  is 
recommended  that  appropriate  studies  be  conducted  to  identify  cost-effective  benthic 
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endpoints  in  relation  to  study  objectives  and  available  resources  to  reduce  or  quantify  the 
uncertainty  associated  with  problems  of  precision. 

A  study  to  develop  predictive  models  of  invertebrate  community  structure  for  the  Fraser 
River  basin  in  British  Columbia  involved  the  sampling  of  250  reference  sites  throughout 
the  240, 000-km^  basin.  To  establish  preliminary  levels  of  precision,  calibration  studies 
with  10  replicate  samples  were  used  to  establish  expected  variation  based  on  replicates 
taken  from  separate  riffles.  This  suggested  that  five  replicate  samples  were  sufficient  to 
produce  coefficients  of  variation  of  20%  to  70  %  for  the  major  families  (Figure  18-1).  More 
importantly,  further  replication  did  not  improve  precision.  A  subsequent  study  at  22 
sites  using  multivariate  analysis  of  three  replicates  from  each  site  showed  that  within-site 
variation  (based  on  replicates)  was  much  less  than  between  site  replication.  This  is  illus¬ 
trated  by  21  of  22  site  replicates  being  clustered  adjacent  to  each  other  (Reynoldson  TB, 
unpublished  data).  As  a  consequence  of  this  study,  it  was  decided  that,  for  identification 
of  communities  at  a  regional  scale,  single  samples  from  a  riffle  were  sufficient.  This  re¬ 
sulted  in  a  five-fold  reduction  in  cost  for  sampling  and  processing  invertebrate  samples 
without  affecting  the  certainty  of  a  site  being  correctly  classified. 

18.4.2  Ecological  relevance 

In  this  context,  ecological  relevance  refers  to  the  relation  of  the  measured  endpoint  to  the 
benthic  ecosystem.  Accordingly,  direct  measures  of  the  populations  of  organisms  present 
have  a  higher  certainty  of  being  related  to  ecosystem  than  measurements  at  a  finer  orga¬ 
nizational  scale  (Schindler  1987).  The  concept  of  using  biological  indicators  of  pollution 
began  in  Germany  at  the  end  of  the  last  century  (Kolkwitz  and  Marsson  1908),  and  by 
the  early  1980s,  more  than  50  different  methods  had  been  identified  (De  Pauw  and 
Vanhooren  1983).  Three  separate  assessment  approaches  using  macroinvertebrates  have 
been  distinguished  (Metcalfe  1989;  Johnson  etal  1993),  based  on  either  taxonomic  or 
pollution  tolerance  data:  1)  saprobic,  2)  diversity,  and  3)  biotic  index  approaches. 

Saprobic  approaches  are  based  on  the  dependence  of  organisms  on  organic  food  sources, 
and  saprobic  values  have  been  published  for  more  than  2000  European  species  (Sladacek 
1973).  This  approach  also  incorporates  organisms  other  than  macroinvertebrates.  Diver¬ 
sity  indices  were  first  applied  to  pollution  assessment  by  Wilhm  and  Dorris  (1968)  and 
were  first  seen  as  a  method  for  incorporating  ecological  principles  into  assessments  and 
avoiding  the  value  judgments  inherent  in  score  systems.  However,  their  use  has  been  fre¬ 
quently  criticized.  Proponents  of  diversity  indices  argue  that  the  indices  measure  real 
ecological  properties  and  processes  and  are  based  on  ecological  theory,  e.g.,  diversity/ 
stability  hypothesis  (Goodman  1975)  and  competitive  interaction  (Hurlbert  1971).  The 
most  widely  used  index  is  the  Shannon-Wiener  index  (Shannon  1948).  These  indices 
reach  their  maximum  value  when  all  species  are  evenly  distributed,  and  high  diversity  is 
considered  desirable  condition.  However,  such  an  even  distribution  of  species  is  not  nor¬ 
mal  in  most  communities,  and  therefore  the  underpinnings  have  to  be  questioned.  The 
actual  meaning  of  diversity  and  how  it  should  be  measured  is  also  questionable  and  has 
been  the  subject  of  much  debate  (Washington  1984).  Furthermore,  different  types  of 
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Figure  18-1  Variation  based  on  the  number  of  replicates  taken  from  separate  riffles 


pollution  may  have  different  impacts  in  different  habitats,  and  major  changes  in  species 
composition  may  occur  with  no  change  in  diversity.  However,  in  a  survey  of  methods 
being  used  in  assessment  (Resh  and  Jackson  1993),  40%  of  studies  had  included  diversity 
as  part  of  a  biological  assessment. 

The  biotic  index  approach  uses  a  single  index  value  to  summarize  information  and  to 
assess  pollution  effects  on  aquatic  communities  and  has  appeal  in  conducting  SERAs. 
The  majority  of  biotic  indices  are  based  on  indicator  organisms  rather  than  community 
composition.  However,  diversity  indices,  which  are  also  considered  by  some  as  a  form  of 
biotic  index,  use  attributes  of  community  structure  such  as  numbers  of  taxa  and  the  na¬ 
ture  of  their  distribution.  The  indicator  organism  concept  assigns  scores  to  species  based 
on  their  sensitivity  or  tolerance  to  various  stressors,  usually  organic  pollution.  This  ap¬ 
proach  tends  to  be  specific  to  one  or  two  types  of  pollution  (usually  organic)  and  conse¬ 
quently  such  systems  are  not  universally  applicable.  For  all  indices,  the  calculations 
require  a  quantitative  or  qualitative  estimate  of  the  individuals  or  taxa  present,  counts  (or 
estimates  of  the  biomass)  of  specific  groups  of  organisms  {e.g.,  Ephemeroptera, 
Plecotpera,  Trichoptera,  Chironomidae),  and  lists  of  the  responses  of  different  taxa  to 
pollution  or  separation  of  invertebrate  groups  based  on  function  {e.g.,  feeding  strategy). 
Their  use  is  either  based  on  a  comparison  of  a  test  site  to  a  theoretical  or  empirically  de¬ 
rived  score  (often  based  on  low-order  stream  communities)  or  based  on  an  upstream- 
downstream  comparison. 

The  biotic  index  approach  has  been  defined  (Tolkamp  1985)  as  one  which  combines  di¬ 
versity  on  the  basis  of  certain  taxonomic  groups  with  the  pollution  indication  of  indi¬ 
vidual  species  (or  higher  taxa)  into  a  single  index  or  score.  As  such,  it  can  be  considered 
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as  a  combination  of  saprobic  (scores)  and  diversity  concepts.  An  array  of  such  biotic  in¬ 
dices  or  biotic  score  systems  have  been  developed;  however,  the  majority  evolved  from 
the  Trent  Biotic  Index  (Woodiwiss  1964).  From  the  proliferation  of  indices  from  1960  to 
1980,  five  current  lines  can  be  identified  as  being  used  in  benthic  assessment.  The  Belgian 
Biotic  Index  (Europe)  and  the  Indice  Biologique  (France)  are  both  true  biotic  index  ap¬ 
proaches.  The  metrics  approach,  widely  used  in  the  United  States,  has  developed  from 
several  lines  but  is  primarily  adapted  from  Karr  s  (1991)  Index  of  Biotic  Integrity  and,  as 
such,  is  also  a  biotic  index.  The  other  main  line  derives  from  the  RIVPACS  approach 
(Wright  et  al.  1984),  used  in  the  United  Kingdom.  The  RIVPACS  approach  was  a  major 
divergence  from  the  biotic  index  approach  as  it  incorporated  a  predictive  capability  and 
relied  on  the  assessment  of  community  structure  rather  than  the  use  of  indicator  organ¬ 
isms.  This  forms  the  basis  for  methods  being  used  in  the  National  River  Health 
Programme  in  Australia  (Commonwealth  Environmental  Protection  Agency  [CEPA] 
1994)  and  the  BEAST  models  being  developed  in  Canada  (Reynoldson  et  al.  1995)  for  the 
Great  Lakes  and  Fraser  River.  Community  structure  measurements  have  a  low  degree  of 
uncertainty  in  terms  of  their  ecological  relevance,  and  changes  at  the  community  level  are 
highly  relevant.  However,  these  approaches  have  seldom  been  applied  to  habitats  with 
soft-bottom  substrates  where  contaminated  sediments  are  of  concern.  Specific  morpho¬ 
logical  endpoints  may  have  relatively  high  uncertainty  with  regard  to  their  ecological 
relevance  (Johnson  et  al  1993).  Even  the  loss  of  an  entire  population  of  a  species  may  or 
may  not  have  ecological  relevance  at  the  ecosystem  scale  depending  on  the  functional 
attributes  of  the  particular  species. 

18.4.3  Causality 

Two  types  of  causality  have  been  identified:  1)  the  relation  of  measured  effects  to  specific 
contaminants  and  2)  the  linkage  with  specific  sources  {i.e.,  spatial  or  geographic  loca¬ 
tion).  Relating  effects  to  cause  in  field  assessments  is  inherently  empirical  and  correlative, 
requiring  associated  chemical  and  laboratory  effects-based  data.  Each  level  of  organiza¬ 
tion  listed  in  Table  18-3  can  provide  information  on  spatial  origin  of  effect  with  an  appro¬ 
priately  designed  sampling  grid  and  effects  mapping  methodologies.  Lack  of  knowledge 
at  present  on  the  specificity  of  response  at  the  individual  level  (Graney  and  Giesy  1987; 
Giesy  1988)  precludes  linkage  to  specific  contaminants,  perhaps  with  the  exception  of 
metallothionein  or  acetylcholinestrase  inhibition  measurements  (Table  18-3).  However, 
further  research  in  this  area  may  provide  useful  measures  that  could  reduce  the  uncer¬ 
tainty  of  cause-and-effect  linkages  (Farris  et  al  1989;  Day  and  Scott  1990). 

At  the  other  extreme,  community  measurements  have  more  potential  to  identify  effects 
(reduce  false  negatives)  but  provide  limited  information  on  specific  contaminants  or 
stressor  causing  the  response.  The  endpoints  that  include  evaluation  of  multiple  species 
are  more  useful  in  mapping  source  origin  compared  to  population  studies.  Furthermore, 
research  on  community  response  in  dose-response  experiments  might  potentially  lead  to 
the  use  of  community  response  in  diagnosing  specific  contaminants  (categories)  in  mul¬ 
tiple  source  situations  (Chapter  9).  It  is  recommended  that  further  research  is  needed  to 
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identify  contaminant-specific  biomarkers  and  species  sensitivity  to  contaminant  catego¬ 
ries. 

18.4.4  Sensitivify 

This  category  relates  to  the  likelihood  that  the  measured  endpoint  is  responding  to  a 
contaminant  and  is  analogous  to  an  analytical  detection  level  for  chemical  analysis.  There 
is  a  need  to  minimize  uncertainty  associated  with  both  false  negatives  (toxic  sample  in¬ 
correctly  classified  as  not  toxic)  and  false  positives  (nontoxic  sample  incorrectly  classified 
as  toxic).  Measurements  of  individuals  and  biomarkers  are  inherently  more  sensitive 
than  measurements  at  the  community  level  (Kingett  1985;  Giesy  1988).  In  the  context  of 
sediment  assessments,  the  relative  usefulness  of  a  method  can  be  determined  by  using 
criteria  adapted  from  Giesy  and  Hoke  (1989)  and  Calow  (1993): 

•  Relevance.  The  ecological  significance  of  the  measurement  at  scales  greater  than 
that  being  tested.  In  all  testing,  we  are  concerned  about  overall  biological 
structure  and  function.  We  cannot  test  all  system  attributes.  Therefore,  the 
measure  being  tested  is  simply  a  surrogate  for  the  system  as  a  whole.  The  ease 
with  which  we  can  relate  the  test  to  the  system  is  a  measure  of  relevance 
(Chapter  12).  In  many  cases,  this  cannot  be  done  quantitatively  but  is  a  subjec¬ 
tive  evaluation. 

•  Robustness.  In  order  to  provide  consistency  in  interpretation  and  decision¬ 
making,  it  is  important  that  the  tests  provide  consistent  results  irrespective  of 
when  and  where  they  are  conducted.  This  is  clearly  more  difficult  in  the  case  of 
field  tests  where  environmental  variation  can  be  large.  However,  standardized 
methods  can  reduce  a  significant  portion  of  the  variation,  and  some  measure¬ 
ment  variables  are  inherently  more  robust  in  either  time  or  space. 

•  Methodology.  Both  ease  of  use  and  detectability  of  the  measurement  endpoint 
(or  availability  of  test  organisms  in  laboratory  tests)  are  important  criteria.  If 
the  method  is  too  complex,  expensive,  or  uses  an  inappropriate  duration,  it  is  of 
limited  value  for  routine  use. 

•  Sensitivity.  The  variable  chosen  for  measurement  should  be  sensitive  to  toxi¬ 
cants  and  in  an  appropriate  range,  sufficient  to  detect  a  response  but  also  to 
discriminate  between  measurement  locations. 

•  Appropriateness/ Application.  The  method  being  selected  should  be  appropri¬ 
ate  to  the  system  for  which  the  study  is  being  designed;  many  useful  methods 
are  only  applicable  to  specific  situations  or  environments.  Some  methods  have 
much  broader  application  than  other  techniques. 

Available  methods  cover  the  entire  range  of  biological  organization  from  the  simplest  (e.g. 
the  cell)  to  most  complex  {e.g.,  ecosystem).  As  Calow  states  (1993)  "...moving  down  the 
hierarchy  [ecosystem  to  molecule],  systems  become  more  easy  to  control,  reaction  times 
reduce  and  criteria  for  robustness,  methodology  and  sensitivity  improve,  and  generality 
should  also  increase.  [However,]  since  ecological  principles  largely  involve  individuals 
and  groups  of  individuals  relevance  [and  application]  increase  in  the  opposite  direction.” 
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In  selecting  methods,  the  individual  has  to  balance  these  two  dilemmas,  and  inevitably 
compromise  decisions  have  to  be  made.  The  effect  has  occurred  by  the  time  a  change  is 
measured  at  the  community  level.  In  that  regard,  there  is  a  low  uncertainty  associated 
with  the  sensitivity  of  the  individual  response.  Concomitantly,  there  is  more  uncertainty 
of  the  absolute  sensitivity  of  the  response  at  the  level  of  the  individual  because  of  less 
knowledge  of  the  test;  however,  this  approach  is  inherently  more  sensitive  and  has  an 
early  warning  component  not  present  in  community  measurements. 

18.4.5  Interferences 

Interferences  as  a  source  of  uncertainty  refers  to  the  confounding  effects  of  biotic  and 
abiotic  factors  in  the  interpretation  of  responses  distinguishable  from  normal  variation 
and  contaminants  associated  with  sediment.  Until  recently,  the  use  of  biological  methods 
was  often  confounded  by  the  temporal  and  spatial  variability  inherent  in  biological  sys¬ 
tems.  However,  over  the  past  10  years,  methods  developed  in  the  United  Kingdom 
(Wright  et  al  1984;  Armitage  et  al  1987;  Moss  et  al  1987;  Ormerod  and  Edwards  1987) 
and  elsewhere  (Corkum  and  Currie  1987;  Johnson  and  Wiederholm  1989)  have  demon¬ 
strated  the  ability  to  predict  the  community  structure  of  benthic  invertebrates  in  clean 
(or  “uncontaminated”)  sites  using  simple  habitat  and  water  quality  descriptors.  This 
approach  has  allowed  the  development  of  site-specific  biological  objectives  to  be  set  for 
ecosystems  selected  by  habitat  characteristics  and  also  provides  an  appropriate  reference 
for  determining  when  degradation  at  a  site  is  due  to  anthropogenic  contamination.  The 
acceptance  by  regulatory  agencies  of  biological  water  and  sediment  quality  objectives  are 
being  considered  in  Canada  (Reynoldson  and  Zarull  1993;  Reynoldson  et  al  1995),  the 
United  States  (Barbour  et  al  1995),  and  the  United  Kingdom  (the  RIVPACS  method; 
Wright  et  al  1984),  and  recent  initiatives  in  Australia  (CEPA 1994).  Because  of  the  smaller 
scale  of  factors  accounting  for  responses  of  individuals,  the  uncertainty  associated  with 
interference  is  lower.  However,  few  studies  have  been  conducted  to  examine  the  role  of 
interference  on  these  approaches.  There  is  limited  information  available  on  normal  lev¬ 
els  for  many  biomarkers.  Population-  and  community-level  measurements  have  continu¬ 
ally  been  confounded  by  the  influence  of  abiotic  factors  and  an  inability  to  distinguish 
between  directional  {e.g.,  response  to  a  trend  or  gradient)  and  nondirectional  {e.g,  sea¬ 
sonal  or  annual)  variability. 

Approaches  using  reference  areas  have  allowed  the  development  of  predictive  models  of 
community  structure  that  minimize  nondirectional  variability  (Wright  et  al  1984; 
Johnson  and  Wiederholm  1989;  Reynoldson  et  al  1995).  This  is  the  primary  reason  for 
the  use  of  multivariate  rather  than  metric  community  structure  assessment  methods. 
Recent  developments  in  the  analysis  of  biological  data  using  multivariate  statistical  tech¬ 
niques  have  shown  promising  results  in  interpreting  changes  in  community  structure 
based  on  environmental  parameters.  As  a  first  step  toward  the  identification  of  the  best 
achievable  community  for  a  specific  type  of  habitat,  there  is  a  need  to  define  reference 
communities  based  on  chemical,  physical,  geological  and  geographical  features  in  areas 
free  from  contamination.  The  greatest  effort  in  this  direction  has  been  made  in  the  study 
of  riffle  habitats  in  streams  and  rivers  of  the  United  Kingdom  (Wright  et  al  1984; 
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Armitage  at.  1987;  Moss  et  al  1987).  Similar  studies  have  been  conducted  in  North 
America  (Corkum  and  Currie  1987),  continental  Europe  (Johnson  and  Wiederholm 
1989),  and  South  Africa  (King  1981).  An  extensive  collection  of  data  at  nonpolluted  sites 
in  the  United  Kingdom  has  resulted  in  the  identification  of  a  number  of  natural  commu¬ 
nities  (Wright  etal  1984).  Environmental  variables  such  as  latitude,  substrate  type,  tem¬ 
perature,  and  depth  were  used  to  correctly  predict  the  benthic  communities  at  75%  of  268 
sites,  and  at  more  than  90%  of  the  sites,  the  observed  community  was  similar  to  either  the 
predicted  or  the  next  most  similar  predicted  community.  At  lower  levels  of  community 
detail,  even  greater  accuracy  of  prediction  was  observed.  Other  studies  have  shown  a 
similar  predictive  capability  (see  Table  18-4),  and  the  accuracy  of  prediction  in  these  stud¬ 
ies  ranges  from  68%  to  90%  in  habitats  varying  from  large  lakes  to  small  streams  and  on 
three  continents.  The  expected  community  assemblages  at  a  location  can  be  defined  from 
a  predictive  model  using  a  limited  number  of  physical  variables.  Such  predicted  commu¬ 
nities  or  key  species  in  the  communities  can  be  used  to  establish  site-specific  guidelines, 
which  may  be  compared  with  the  actual  species  composition.  Thus,  determinations  on 
whether  or  not  the  guideline  is  being  met  can  be  made.  It  is  recommended  that  a  stan¬ 
dard  set  of  environmental  attributes  be  established  to  foster  future  development  of  mod¬ 
els  for  predicting  variability  of  community  structure  in  reference  areas  and  that  these 
approaches  be  further  applied  to  habitats  with  soft-bottom  sediments  where  contami¬ 
nated  sediments  are  of  concern. 

18.4.6  Standardization 

The  possible  degree  of  standardization  is  high  for  population  and  community  measure¬ 
ments;  therefore,  the  degree  of  uncertainty  resulting  from  standardization  can  be  mini¬ 
mized.  There  are  many  reviews  of  sampling  devices,  methods,  and  handling  procedures, 
and  consistent  methods  are  an  absolute  requirement  (Elliott  1977;  Plafkin  et  al  1989; 
Norris  and  Georges  1993).  Many  studies  have  illustrated  that  apparent  responses  result¬ 
ing  from  methodological  changes  {e,g.,  mesh  size,  grab)  can  outweigh  effects  resulting 
from  contaminants.  The  uncertainty  resulting  from  a  lack  of  standard  methods  with  use 
of  biomarkers  and  with  individual  tests  is  high  because  of  the  relative  newness  of  these 
methods.  It  is  recommended  that  studies  be  undertaken  to  establish  methods  appropri¬ 
ate  to  the  issue  of  concern  and  that  methods  not  be  changed  once  established.  Where 
possible,  common  methods  should  be  used.  If  a  methodological  change  is  essential,  then 
appropriate  calibration  is  necessary. 

18.4.7  Discrimination 

Discrimination  refers  to  the  ability  of  a  measure  to  identify  a  response  range.  This  is  re¬ 
lated  to  the  continuous  nature  of  the  measurement  endpoint  and  the  variables  associated 
with  that  measure.  The  large  number  of  species  involved  in  community  measures  reduces 
the  uncertainty  that  an  endpoint  will  not  be  unable  to  discriminate  between  contaminant 
levels.  While  it  could  be  argued  that  the  loss  of  one  or  two  species  may  be  missed  at  the 
community  level  of  analysis,  the  loss  of  single  species  is  not  necessarily  of  significance 
unless  they  affect  the  overall  community  structure,  at  which  point  such  effects  will  be 
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Table  18-4  Ability  to  predict  benthic  invertebrate  community  structure 


Habitat 

Number 

sites 

Number 

community 

groups 

Predictor 

variables 

Prediction 

success  rate  Reference 

Lakes 

68 

7 

11 

90.0 

Johnson  and  Wiederholm  1989 

Lakes 

95 

6 

9 

90.0 

Reynoldson  etal.  1995 

Rivers 

35 

5 

6 

75.4 

Reynoldson  and  Zarull  1993 

Rivers 

45 

6 

5 

68.9 

Ormerod  and  Edwards  1987 

Rivers 

54 

5 

9 

79.6 

King  1981 

Rivers 

286 

16 

28 

76.1 

Wright  etal  1984 

Streams 

79 

5 

26 

70.9 

Corkum  and  Currie  1987 

observed  by  comparison  of  changes  at  the  community  level.  New  approaches  to  commu¬ 
nity-level  analysis  can  identify  normal  variation  in  unimpacted  communities  and  detec¬ 
tion  of  change  at  a  pre-selected  probability  level  (Reynoldson  etal  1995).  Furthermore, 
assessment  methods  being  used  in  the  United  Kingdom  and  Australia  allow  the  predic¬ 
tion  of  the  number  of  taxa  at  different  taxonomic  levels  {e.g.,  family,  species)  and  there¬ 
fore  the  absence  of  single  species  can  be  established,  again  at  predetermined  probability 
levels  (CEPA 1994;  Wright  1995).  Conversely,  a  single  individual  or  biomarker  has  higher 
uncertainty  of  being  a  useful  discriminator.  Decisions  require  comparison  of  the  mea¬ 
surement  with  some  reference  or  control  value.  There  is  inevitably  a  level  of  uncertainty 
associated  with  such  decision-making.  Community-  and  population-level  criteria  are 
being  established  (Karr  1990;  Davis  and  Simon  1995;  Reynoldson  et  al  1995;  Yoder  and 
Rankin  1995).  However,  we  are  unsure  as  to  whether  analysis  has  been  conducted  on  the 
degree  to  which  error  has  been  established  in  the  use  of  such  criteria.  It  is  recommended 
that  an  analysis  of  the  error  associated  with  benthic  criteria  be  undertaken. 

18.4.8  Bioavailabilify 

Community-level  assessment  methods  {e.g,  diversity  indices,  biotic  indices)  can  be  af¬ 
fected  by  interference  from  noncontaminant  effects;  however,  newer  approaches  being 
used  to  help  address  this  issue.  The  use  of  reference  sites  to  establish  expected  normal 
variation  is  an  effective  method  of  addressing  this  issue,  and  approaches  to  establishing 
reference  conditions  are  addressed  (Hughes  1995;  Omernick  1995;  Reynoldson  et  al. 
1995;  Wright  1995).  Because  of  the  lack  of  knowledge  on  the  specificity  of  response  in 
biomarkers,  we  are  more  uncertain  of  the  meaning  of  a  negative  result  in  biomarker  tests. 

18.4.9  Field  validation 

The  responses  in  populations  or  communities  are  field  responses.  We  are  uncertain  as  to 
the  degree  of  validation  required  for  biomarkers;  however,  some  relation  to  other  field 
measurements  is  necessary.  Field  validation  is  understood  to  imply  the  likelihood  of  the 
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measure  being  used  is  an  appropriate  indicator  of  risk  associated  with  sediment  contami¬ 
nation.  The  uncertainty  associated  with  the  accuracy  of  benthic  community  assessments 
is  primarily  either  systematic  or  results  from  a  lack  of  knowledge  (/>.,  background  vari¬ 
ability).  Insufficient  information  exists  on  the  response  of  endpoints  at  the  level  of  the 
individual  or  the  population  to  establish  that  a  change  in  a  measured  endpoint  is  a  spe¬ 
cific  response  to  a  contaminant.  Insufficient  information  exists  to  establish  sensitivity  of 
species  or  individuals  to  sediment-associated  contaminants.  However,  community-level 
assessments  are  more  likely  to  include  sediment  effects  as  the  number  of  species  exam¬ 
ined  increases  the  probability  of  observing  a  sediment-associated  response.  It  is  recom¬ 
mended  that  the  relations  should  be  examined  between  responses  measured  using 
biomarkers  and  responses  measured  at  the  population  and  community  level. 

18.4.10  Summary 

The  use  of  benthic  invertebrates  in  the  assessment  and  management  of  contaminated 
sediments  has  been  recommended  as  part  of  the  sediment  quality  triad  approach,  which 
incorporates  aspects  of  sediment  chemistry,  toxicity  testing,  and  analysis  of  the  commu¬ 
nity  structure  of  the  benthos  (Long  and  Chapman  1985;  Canfield  et  al  1996).  The  IJC 
(1987, 1988)  has  suggested  a  sediment  management  strategy  that  incorporates  both  as¬ 
sessment  and  remediation.  The  United  States  has  embarked  on  a  program  to  examine 
various  approaches  to  the  assessment  and  remediation  of  contaminated  sediments 
(ARCS  program;  Fox  and  Tuchman  1996),  and  Canada  is  also  addressing  these  issues  via 
the  Great  Lakes  Action  Plan.  However,  there  are  still  major  objections  to  the  use  of  the 
structure  of  the  benthic  community  to  set  sediment  criteria  or  objectives.  The  main  criti¬ 
cisms  are  their  lack  of  universality  {i.e.,  they  are,  of  necessity,  site-specific)  and  the  inabil¬ 
ity  of  researchers  to  establish  quantitative  objectives  for  their  application  {i.e.,  what 
should  the  community  “look”  like?).  Universal  guidelines  may  not  be  possible  due  to  the 
very  nature  and  complexity  of  sediment-contaminant-biota  relationships  and  the  diver¬ 
sification  of  biological  and  geological  components  over  a  large  regional  area. 

Recent  developments  in  the  analysis  of  biological  data  using  multivariate  statistical  tech¬ 
niques  have  shown  promising  results  in  interpreting  changes  in  community  structure 
based  on  environmental  parameters.  As  a  first  step  toward  the  identification  of  the  best 
achievable  community  for  a  specific  type  of  habitat,  there  is  a  need  to  define  reference 
communities  based  on  chemical,  physical,  geological,  and  geographical  features  in  areas 
free  from  contamination.  Selecting  only  a  few  physical  variables,  the  expected  commu¬ 
nity  assemblages  at  a  location  has  been  defined  from  predictive  models.  Such  predicted 
communities  or  key  species  in  the  communities  can  be  used  to  establish  site-specific 
guidelines,  which  may  be  compared  with  the  actual  species  composition.  Thus,  determi¬ 
nations  could  be  made  on  whether  or  not  the  guideline  is  being  met.  While  this  reference 
community  approach  has  been  applied  to  riffle  habitats  at  a  variety  of  locations,  addi¬ 
tional  studies  are  needed  to  further  evaluate  this  approach  in  habitats  with  soft-bottom 
sediments  where  contaminated  sediments  are  of  concern.  To  date,  this  approach  has  only 
been  used  on  community  assemblages  with  species  as  the  classifying  variables.  There  is 
no  reason  why  the  same  approach  could  not  be  applicable  to  either  structural  or  func- 


SETAC  Press 


320 


ECOLOGICAL  RISK  ASSESSMENT  OF  CONTAMINATED  SEDIMENTS 

tional  variables  such  as  reproduction,  growth,  or  biomarker  tests  that  can  be  included  in 
a  multivariate  design  to  classify  both  predictable  communities  and  responses  in  tests. 

18.5  Evaluation  of  uncertainly  associated  with 
bioaccumulation  assessments  of  sediments 

The  principal  use  of  bioaccumulation  is  to  estimate  the  exposure  or  dose  which  organ¬ 
isms  encounter  in  a  sediment.  Bioaccumulation  is  not  an  appropriate  assessment  ap¬ 
proach  for  contaminants  that  are  metabolized  or  otherwise  not  accumulated  in  the 
tissues  of  the  organisms  being  evaluated.  Four  general  approaches  for  bioaccumulation 
are  used  in  risk  assessment;  1)  a  laboratory  approach  exposing  organisms  to  sediment 
under  controlled  conditions,  2)  a  field  approach  collecting  species  from  a  study  area,  3) 
assessment  of  food  web  transfer,  and  4)  models  to  predict  bioaccumulation  processes 
(Chapter  15).  Uncertainties  associated  with  the  approaches  are  summarized  in  Table 
18-5. 


Laboratory  approach 

Individuals  of  a  single  species  are  exposed  under  controlled  laboratory  conditions  to 
sediments  collected  from  the  study  area  being  assessed  (USEPA  1994a;  ASTM  1995e). 
After  an  established  period  of  exposure,  the  tissues  of  the  organisms  are  analyzed  for  the 
contaminant  of  concern.  Bioaccumulation  has  occurred  if  the  final  concentration  in  tis¬ 
sues  exceeds  concentrations  that  were  present  before  the  exposure  was  started.  This  re¬ 
quires,  of  course,  that  individuals  representative  of  initial  conditions  are  also  analyzed. 

Field  approach 

Tissue  concentrations  of  contaminants  are  determined  by  collecting  one  or  more  species 
exposed  to  sediments  at  the  study  area  being  assessed.  These  concentrations  are  com¬ 
pared  to  some  reference  areas  for  interpretation.  Two  methods  have  been  used  to  deter¬ 
mine  bioaccumulation  in  the  field:  1)  organisms  resident  at  the  area  are  collected  in  situ 
for  analysis  or  2)  organisms  are  transplanted  from  another  location  (presumably  with  a 
history  of  little  contaminant  exposure)  to  the  area  of  concern,  then  re-collected,  and  tis¬ 
sues  are  analyzed  after  an  established  period  of  exposure.  The  first  approach  takes  advan¬ 
tage  of  organisms  which  have  integrated  exposure  to  the  sediments  in  the  area  over  their 
lifetimes.  However,  if  sediments  in  the  area  are  highly  contaminated  or  if  environmental 
conditions  in  the  area  are  otherwise  unsuitable,  no  appropriate  organisms  may  be  avail¬ 
able  (Cain  et  al.  1992).  For  example,  hydropsychid  caddisflies  have  been  successfully  used 
to  assess  changes  in  metal  exposures  over  400  km  of  a  river  receiving  input  of  mining 
wastes  (the  Clark  Fork  River  in  Montana).  Although  relatively  tolerant  to  metal  contami¬ 
nation,  these  organisms  do  not  survive  in  reaches  of  the  river  adjacent  to  a  mining-smelt¬ 
ing  complex.  Thus,  downstream  assessments  of  exposure  in  less  contaminated  areas 
were  possible,  but  no  data  could  be  obtained  in  the  reach  of  the  stream  most  affected  by 
the  contamination  (Cain  etal  1992). 

Comparisons  among  several  species  are  employed  in  some  bioaccumulation  studies  (al¬ 
though  different  species  should  never  be  combined  in  a  single  sample,  unless  a  lack  of 
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Table  18-5  Uncertainty  associated  with  bioaccumulation  assessments 


Evaluation  criteria 

Food  web 

Models 

Laboratory 

Field 

Precision 

3 

3 

1 

2 

Ecological  relevance:  protection  of  ecology 

3 

3 

3 

3 

Ecological  relevance;  protection  of  human 
health 

1 

1 

1 

1 

Causality:  source  identification 

3 

1 

1 

3 

Causality:  sensitivity  (detection  limit) 

3* 

3* 

1 

2 

Interferences 

NA 

NA 

2 

2* 

Standardization 

2 

2 

1 

1 

Discrimination 

1 

1 

1 

1 

Bioavailability 

1 

1 

1 

1 

Field  validation 

2* 

2* 

2* 

1 

1  =  low  uncertainty  (good) 

3  =  high  (bad) 

*  =  lack  of  knowledge 
NA  =  not  applicable 


difference  is  proven).  NOAA  s  Mussel  Watch  employs  mussels  in  marine  ecosystems,  oys¬ 
ters  in  brackish  waters,  and  freshwater  clams  in  the  least  saline  or  freshwater  habitats. 
Data  from  different  species  have  been  useful  in  some  descriptions  of  exposures 
(Campbell  et  al  1988;  Cain  et  al  1992),  but  comparing  tissue  residues  among  species  can 
be  complex  if  the  objectives  of  the  exposure  assessment  are  quantitative. 

Transplantation  of  organisms  assures  that  organisms  will  be  available  for  analysis  (unless 
the  exposure  results  in  mortality  or  avoidance).  However,  the  exposure  is  typically  limited 
in  duration  (less  than  a  generation  time),  and  uncertainties  about  the  kinetics  of  bioac¬ 
cumulation  may  affect  the  accuracy  of  the  results.  The  behavior  of  the  transplanted  or¬ 
ganism  or  whether  the  organism  was  exposed  to  contaminants  via  the  routes  which  are 
important  for  resident  species  is  a  source  of  uncertainty  in  this  approach.  For  example, 
mussels  {Mytilus  edulis  or  Mytilus  californianus)  are  often  exposed  in  “bagged  bivalve” 
deployments,  sometimes  where  mussels  do  not  live  naturally.  Inadequate  availability  of 
food  or  other  stresses  in  the  deployment  area  could,  at  least  theoretically,  result  in  differ¬ 
ent  bioaccumulation  than  occurs  in  resident  species.  Careful  comparisons  of  bioaccumu¬ 
lation  in  resident  and  transplanted  species  are  rare  (Cain  and  Luoma  1985). 

Models 

Models  which  describe  bioaccumulation  are  relatively  well  developed  for  both  organic 
and  inorganic  contaminants  (Thomann  1989;  Chapter  15).  Toxicokinetic  models  have  a 
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long  history,  as  do  simpler  models  of  bioaccumulation  processes.  Site-specific  models 
predict  bioaccumulation  on  the  basis  of  laboratory-determined  characterization  of  bio¬ 
logical  processes  in  the  species  of  interest  and  field-determined  chemical  measurements 
at  the  area  of  concern.  Predictive  modeling  is  relatively  promising  in  this  field  because 
bio  accumulation  is,  arguably,  less  complex  than  some  of  the  other  endpoints.  Some  un¬ 
certainties  remain  unresolved  in  most  models  and  consensus  does  not  exist  about  the 
appropriate  model  to  apply  for  some  (if  not  all)  contaminants  (Chapter  15). 

Models  of  two  types  are  considered  in  bioaccumulation.  Equilibrium  models  are  com¬ 
mon  in  risk  assessment  and  in  sediment  criteria.  Geochemically  robust  models  of  equi¬ 
librium  partitioning  are  available  for  application  for  both  organic  and  inorganic 
contaminants.  These  can  be  relatively  simple  to  apply.  The  models  assume  contaminant 
concentrations  among  all  compartments  of  the  environment  are  controlled  by  thermo¬ 
dynamics  and  at  least  approach  equilibrium  conditions.  If  thermodynamic  equilibrium 
exists,  and  if  one  route  of  uptake  is  known  or  can  be  predicted,  overall  bioaccumulation 
is  inferred.  Recent  applications  use  an  extension  of  the  equilibrium  models,  termed  j(:/- 
netic  or  pathway  models  (ASTM  1995e).  These  models  incorporate  recognized  geochemi¬ 
cal  principles  but  recognize  the  uncertainties  in  the  assumptions  of  equilibrium.  Kinetic 
models  assume  that  routes  of  bioaccumulation  are  additive  and  must  be  determined  in¬ 
dependently.  Kinetic  models  and  equilibrium  models  may  yield  similar  results  if  con¬ 
taminant  distributions  and  concentrations  in  an  environment  are  at  equilibrium 
(although  not  always)  but  can  yield  very  different  results  where  environmental  compart¬ 
ments  are  not  at  equilibrium  (e.g.,  if  biological  processes  control  concentrations,  specia- 
tion,  or  phase  partitioning  of  contaminants).  Determining  whether  equilibrium 
conditions  exist  in  the  area  is  difficult. 

18.5.1  Precision 

Precision  is  the  stochastic,  and  perhaps  systematic,  uncertainty  that  occurs  in  repeated 
determinations  of  bioaccumulated  tissue  concentrations  {stochastic  is  used  in  a  purely 
statistical  sense,  in  that  much  of  the  uncertainty  has  its  origins  in  biological  processes 
which  are  at  least  partly  understood).  Variability  is  a  common  problem  in  bioaccumula¬ 
tion  studies  and  can  lead  to  imprecise  estimates  of  exposure  (Lobel  1987),  although 
methods  for  determining  bioaccumulation  have  identified  procedures  for  avoiding  ex¬ 
treme  sources  of  uncertainty  (Phillips  1980;  Campbell  et  al  1988;  Cain  et  al,  1992; 
Crawford  and  Luoma  1992;  Phillips  and  Rainbow  1993;  USEPA  1994a;  USEPA  and 
USACE 1994;  ASTM  1995e). 

Laboratory  bioaccumulation  tests  are  potentially  the  most  precise  of  bioaccumulation 
approaches  (Table  18-5).  However,  their  precision  is  directly  dependent  upon  biological 
factors  such  as  the  selection  of  appropriate  test  organisms;  use  of  healthy  and  acclimated 
organisms;  avoidance  of  anaerobic  conditions  and  other  sources  of  ancillary  stress  in  the 
test;  and  use  of  organisms  of  similar  size,  age,  and  life-history  stage.  Many  organisms  are 
capable  of  avoiding  potentially  toxic  concentrations  of  contaminant  sediments.  There¬ 
fore,  exposure  to  extremely  contaminated  sediments  can  result  in  a  highly  variable  bio¬ 
accumulation  response  by  some  test  organisms  (ASTM  1995e).  Even  under  the  best  of 
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conditions,  individuals  can  respond  very  differently  to  a  contaminated  sediment.  Thus, 
the  number  of  individuals  exposed  in  the  test  (n>12  is  ideal;  n<5  may  be  unacceptable) 
is  a  crucial  consideration,  as  is  the  number  of  replicates  analyzed  if  individuals  are  com¬ 
posited  for  analysis.  If  organisms  accumulate  sufficient  contaminants  to  elicit  toxicity,  the 
kinetics  of  bioaccumulation  may  be  altered  and  organisms  exposed  to  lower  doses  in 
some  sediments  may  accumulate  greater  concentrations  (Landrum  et  al  1994). 

Evaluation  of  organisms  exposed  in  the  field  and  food  web  analyses  require  the  same 
careful  attention  to  methods  as  described  above  for  laboratory  studies.  The  inherent  vari¬ 
ability  of  nature  is  added  to  the  potential  for  imprecision  which  can  be  associated  with 
laboratory  studies.  Number  of  individuals  sampled,  number  of  composites,  life  stage, 
size  of  organisms,  biases  from  analysis  of  gut  content,  or  surface  contamination  are  ex¬ 
amples  of  uncertainty  associated  with  field  approaches.  Cain  etal.  (1992)  and  Crawford 
and  Luoma  (1992)  have  addressed  some  of  these  sources  of  variability  in  the  use  of  field- 
collected  organisms  (z.e.,  estuarine  bivalves,  riverine  insect  larvae).  Large  sample  sizes  as 
well  as  characterization  of  potential  biases  such  as  species-specific  and  element-specific 
responsiveness  to  contaminants,  organism  size,  seasonal  growth,  temporal  variability, 
and  spatial  variability  on  different  scales  can  improve  precision  and  interpretability  of 
bioaccumulation  data.  Rigorous  methods  resulted  in  the  statistical  power  to  separate  15% 
to  25%  differences  in  mean  tissue  concentrations  with  95%  confidence  (Brown  and 
Luoma  1995).  When  resident  species  are  sampled,  an  additional  source  of  uncertainty  is 
the  ability  to  obtain  sufficient  numbers  of  individuals  to  reduce  imprecision  or  replicate 
composite  analyses.  This  can  often  be  overcome  by  careful  selection  of  target  species.  The 
field  approaches  are  not  discriminatory  alone  to  determine  if  sediments  are  the  source  of 
the  exposure.  Models  and  laboratory  exposures  are  needed  to  better  identify  routes  of 
exposure. 

Bioaccumulation  models  have  been  ranked  as  imprecise  because  of  the  large  knowledge 
gaps  which  remain  in  identifying  values  for  model  parameters  (Table  18-5).  Some  key 
parameter  values  differ  by  orders  of  magnitude  among  studies,  and  predictions  differ 
accordingly.  Rapid  advances  are  overcoming  this  difficulty  and  generalized  predictions 
are  developing,  but  until  scientific  consensus  about  model  approaches  is  reached,  model 
imprecision  in  site-specific  predictions  must  be  considered  a  significant  problem. 

18.5.2  Ecological  relevance 

Ecological  relevance  includes  both  relevance  to  ecological  change  and  relevance  to  human 
exposure  pathways.  A  limitation  to  the  bioaccumulation  approach  is  its  weak  link  to  eco¬ 
logical  change.  Bioaccumulation  does  not  mean  an  adverse  effect  is  occurring.  Organisms 
are  capable  of  detoxifying,  adapting  to,  or  otherwise  surviving  some  dose  of  all  contami¬ 
nants.  Correlations  between  bioaccumulated  contaminants  and  effects  of  contaminants 
are  also  not  as  well  established.  Widdows  and  Donkin  (1989)  showed  a  strong  link  be¬ 
tween  tissue  residues  of  organic  contaminants  and  physiological  measurements  of  health 
in  mussels.  Similar  linkages  are  not  established  for  metals.  Bioaccumulation  of  specific 
biochemical  fractions  of  metals  needs  to  be  better  understood  before  such  linkages  will 
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be  possible.  In  addition,  the  biochemical  status  of  organisms  that  may  affect  toxicity 
needs  to  be  better  understood. 

The  relevance  of  bioaccumulation  stems  mainly  from  its  value  in  characterizing  expo¬ 
sures.  Understanding  dose  is  essential  to  understanding  effects,  whether  dose  is  from  the 
chemical  itself  or  from  its  metabolites.  The  concentration  of  contaminants  assimilated 
into  tissues  may  be  the  most  sensitive  measure  of  the  dose  of  contaminant  an  organism 
experiences,  given  the  uncertainties  in  defining  bioavailability  from  chemical  measures 
alone.  Exposures  are  also  complex  in  nature  (Luoma  etal  1985, 1990);  the  temporal  and 
spatial  complexities  of  exposures  are  an  important  reason  why  effects  of  contaminants 
are  difficult  to  demonstrate  with  simplistic  studies  (Luoma  and  Carter  1991). 

Collection  of  organisms  exposed  in  the  field,  food  web  bioaccumulation  estimates,  and 
both  empirical  and  site-specific  models  provide  direct  determination  of  contaminant 
concentrations  in  aquatic  resource  (food)  species  and  provide  information  for  pathways 
of  human  exposure  (Table  18-5).  Where  tissue  concentrations  are  directly  determined  in 
the  food  organism,  there  is  little  uncertainty  about  relevance.  The  precise  human  path¬ 
way  exposure  is  predicted  with  less  certainty  if  analyses  of  a  surrogate  species  are  used  to 
estimate  human  exposures  from  a  variety  of  species  in  an  environment.  Nevertheless, 
carefully  designed  studies  can  reduce  uncertainty.  Laboratory  studies  are  not  highly  rel¬ 
evant  predictors  for  human  exposure  because  they  are  often  poor  predictors  of  steady- 
state  field  bioaccumulation.  The  exception  is  when  laboratory  studies  are  designed  to 
provide  parameter  values  for  models. 

18.5.3  Causality 

Causality  includes  linkage  to  sources  and  pathways  or  linkage  to  the  contaminant  caus¬ 
ing  the  effect.  Field  determinations  reflect  a  combination  of  exposure  pathways.  Bioaccu¬ 
mulation  data  alone  cannot  provide  information  about  whether  the  source  of  exposure 
was  water,  sediment,  or  pore  water.  Uncertainty  about  the  route  of  exposure  for  field  bio¬ 
accumulation  leads  to  uncertainty  in  the  problem  formulation  step  of  the  risk  analysis 
(Chapter  1).  Without  knowledge  of  the  most  important  route  of  exposure,  it  is  unclear 
whether  chemical  measurements  should  include  water,  sediment,  pore  water,  or  some 
fraction  of  each.  Models  which  combine  biological  parameters  and  field  data  on  water 
and  sediment  concentrations  can  help  resolve  this  problem  (Thomann  1989;  Luoma  etal 
1990).  Bioaccumulation  is  the  strongest  of  the  endpoints  in  drawing  linkages  to  the  con¬ 
taminant  of  concern,  because  it  involves  direct  determinations  of  those  contaminants. 
However,  bioaccumulation  alone  cannot  be  used  to  evaluate  effects  of  contaminants  on 
aquatic  organisms. 

18.5.4  Sensitivity 

Bioaccumulation  is  a  readily  detectable  response  because  it  measures  exposure  of  an  or¬ 
ganism  to  relevant  contaminants.  False  positive  responses  are  not  expected  with  bioaccu¬ 
mulation  estimates.  False  negative  determinations  of  whether  bioaccumulation  is 
occurring  should  be  less  prevalent  compared  to  the  other  measurement  endpoints  dis¬ 
cussed  in  previous  sections,  but  systematic  error  in  identifying  exposure  (false  negatives) 
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can  result  from  selection  of  inappropriate  contaminants,  test  organisms,  or  sampling  ap¬ 
proaches.  For  example,  bioaccumulation  is  not  an  appropriate  measure  of  chemicals 
which  are  metabolized  {e.g.,  PAHs  in  fish;  ASTM  1995e).  Bioaccumulation  is  not  appro¬ 
priate  for  determining  exposures  to  ammonia  or  some  metals,  which  are  not  bioaccumu¬ 
lated  before  exerting  toxic  effects.  Bioaccumulation  of  Zn  by  some  organisms  may  be 
physiologically  regulated  (ASTM  1995e).  Some  species  of  trout  infireshwater  streams  and 
some  species  of  bivalves  in  estuarine  environments  show  little  or  no  change  in  tissue  Zn 
concentrations  in  the  presence  of  contamination,  whereas  other  species  from  the  same 
groups  do  respond  (Moore  et  al  1990).  Mussels  and  some  other  filter-feeding  bivalves  are 
responsive  to  inputs  of  most  metals  but  respond  weakly  to  copper  contamination  (Brown 
and  Luoma  1995).  Laboratory  studies  and  transplant  field  studies  allow  control  of  the 
species  and  contaminant,  so  the  above  uncertainties  can  be  controlled.  Few  choices  are 
sometimes  available  in  selection  of  species  to  be  collected  from  the  field;  this  could  lead 
to  uncertainty  about  sensitivities.  Models’  results  will  be  fraught  with  uncertainty  about 
sensitivity  until  widely  accepted  parameter  values  are  established. 

18.5.5  Interferences 

Interferences  can  add  uncertainties  to  bioaccumulation  studies.  Sediment  characteristics 
are  an  important  source  of  uncertainty  in  laboratory  bioaccumulation  studies  (Table 
18-5)  because  collection,  transport,  and  deployment  can  change  sediment  characteristics 
from  conditions  in  situ.  Natural  factors,  mostly  biological,  can  add  stochastic  uncertainty 
to  field  determinations  of  bioaccumulation,  a  type  of  interference.  Variability  in  the  size 
of  organisms  within  a  sample  can  increase  error  by  two-fold  or  more,  if  size  is  not  ac¬ 
counted  for.  Brown  and  Luoma  (1995)  suggested  approaches  to  reducing  interferences 
which  can  affect  interpretation  of  field  bioaccumulation  data. 

Variability  over  small  spatial  scales  can  add  to  the  uncertainty  of  discriminating  between 
areas  on  larger  scales.  For  example,  temporal  variability  can  also  be  a  great  source  of 
uncertainty.  In  some  environments,  seasonal  variability  in  metal  concentrations  or  or¬ 
ganic  contaminant  concentrations  can  reach  four-  or  five-fold  (Luoma  et  al  1985).  In 
other  environments,  such  as  many  coastal  marine  systems,  seasonal  variability  can  be 
low.  Year-to-year  variability  in  metal  concentrations  in  insect  larvae  collected  at  low  flow 
from  the  Clark  Fork  River  (in  August)  is  more  than  two-fold  between  some  adjacent  years 
and  more  than  four-fold  between  other  years. 

18.5.6  Standardization 

Standardization  of  approaches  is  essential  in  use  of  the  bioaccumulation  estimations  (see 
the  studies  described  in  the  section  discussing  precision).  Use  of  different  methodologies 
is  an  important  source  of  uncertainty  when  attempting  to  compare  different  datasets. 
Consistent  approaches  for  field  sampling  and  laboratory  bioaccumulation  can  reduce 
uncertainty  (Phillips  1980;  Campbell  etal  1988;  Crawford  and  Luoma  1992;  Phillips  and 
Rainbow  1993;  USEPA  1994a;  USEPA  and  USACE 1994;  ASTM  1995e;  Brown  and  Luoma 
1995).  Models  are  the  least  standardized  of  the  approaches  (Table  18-5;  Chapter  15). 
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18.5.7  Discrimination 

In  the  previous  applications,  this  criterion  was  intended  to  evaluate  the  ability  of  a  mea¬ 
surement  endpoint  to  discriminate  between  contaminated  sediments  by  providing  a 
graded  response.  Bioaccumulation  is  inherently  a  continuous  response  (with  the  excep¬ 
tion  of  some  of  the  circumstances  described  under  sensitivity).  The  ability  of  bioaccumu¬ 
lation  to  discriminate  contamination  gradients  with  low  uncertainty  is  one  of  its 
advantages.  Inherently,  bioaccumulation  is  a  highly  quantitative  approach  for  discrimi¬ 
nating  the  risk  of  exposure  from  a  sediment. 

18.5.8  BioavaMability 

Bioaccumulation  directly  measures  bioavailability  in  both  laboratory  and  field  studies. 
Some  qualitative  uncertainty  in  bioavailability  (if  it  is  defined  as  contaminants  assimi¬ 
lated  into  tissues)  can  occur  in  determination  of  whole-tissue  concentrations.  This  is 
important  because  whole-tissue  concentrations  are  typically  analyzed  in  bioaccumula¬ 
tion  studies.  Undigested  gut  content  can  be  analyzed  as  part  of  the  tissue  burden  and 
cause  systematic  uncertainties  (upward  bias)  in  estimates  of  bioavailability  if  contami¬ 
nant  concentrations  in  food  are  high  compared  to  tissues  (and  if  food  mass  in  the  gut  is 
sufficiently  great).  Some  organisms  accumulate  contaminants  on  their  surfaces  (insect 
larvae  in  mine-contaminated  streams)  or  cell  walls  (microalgae).  Contaminants  in  gut 
content  and  on  animal  surfaces  will  be  consumed  by  predators,  so  there  is  no  consensus 
on  the  need  to  purge  all  undigested  contamination  from  the  gut  of  organisms.  Even  con¬ 
sidering  the  above  caveats,  bioaccumulation  is  the  least  ambiguous  approach  for  assess¬ 
ing  contaminant  bioavailability. 

18.5.9  Field  validation 

Field  approaches  with  resident  species  directly  determine  bioaccumulation  (field  valida¬ 
tion  is  inherent).  Laboratory  bioaccumulation  can  be  validated.  When  enriched  tissue 
concentrations  are  observed  in  the  laboratory,  they  are  expected  in  the  environment. 
However,  there  is  uncertainty  in  quantitatively  attempting  to  relate  laboratory  bioaccu¬ 
mulation  results  to  the  field.  Laboratory  bioaccumulation  over  days  or  weeks  reflects  a 
mixture  of  influx  and  elimination  kinetics  and  may  not  reflect  lifetime  exposures  of  or¬ 
ganisms  in  the  field.  Some  studies,  especially  with  small  organisms,  have  successfully 
related  bioaccumulation  obtained  in  the  laboratory  with  field-collected  sediments  to  resi¬ 
due  concentrations  observed  in  synoptic  collection  of  organisms  from  the  field  (Ankley 
et <il  1992;  Brunson  et  al  1996).  Combined  laboratory  and  model  studies  show  greatest 
potential  for  reducing  uncertainties  in  extrapolating  from  controlled  studies  to  nature. 

1 8.5.1 0  Summary 

The  principal  use  of  bioaccumulation  is  to  estimate  the  exposure  or  dose  which  organ¬ 
isms  encounter  in  a  sediment.  Bioaccumulation  is  not  an  appropriate  assessment  ap¬ 
proach  for  contaminants  which  are  metabolized  or,  for  other  reasons,  not  accumulated 
in  the  tissues  of  the  organisms  being  evaluated.  Another  limitation  to  the  bioaccumula¬ 
tion  endpoint  is  its  weak  link  to  ecological  change.  Bioaccumulation  does  not  mean  an 
adverse  effect  is  occurring.  The  relevance  of  bioaccumulation  stems  mainly  from  its  value 
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in  characterizing  exposures  and  understanding  the  dose  an  organism  experiences.  This 
can  be  especially  valuable  information  if  used  to  expand  understanding  of  bioavailability 
or  if  exposures  are  complex  in  space  or  time  (as  is  often  the  case)  at  the  site  of  interest. 
Bioaccumulation  can  be  a  highly  variable  endpoint,  but  if  established  methods  are  fol¬ 
lowed  and  sample  size  is  adequate,  variability,  imprecision,  and  insensitivity  can  be  con¬ 
trolled. 

18.6  Evaluation  of  uncertainty  associated  with  chemistry 
measurements  of  sediments 

Sediment  chemistry  data  represent  a  fundamental  element  of  risk  assessments  focused 
on  evaluation  of  the  effects  of  toxic  chemicals  in  sediments.  Chemical  concentrations  are 
often  reported  on  a  dry  weight  basis,  based  on  a  total  extraction  of  sediment  samples. 
However,  measurements  of  total  chemical  concentrations  can  be  “normalized”  to  certain 
physical-chemical  characteristics  of  sediment  to  aid  in  interpretation  of  analytical  chem¬ 
istry.  For  example,  the  concentrations  of  nonionic  organic  contaminants  have  been  nor¬ 
malized  to  TOC  concentrations  in  sediment  (Swartz  et  al  1990;  Di  Toro  et  al  1991).  In 
addition,  AVS-normalization  procedures  may  be  used  to  interpret  metals  data  (Di  Toro 
etal  1990, 1992;  Ankley,  Phipps,  etal  1991).  Alternatively,  metal  concentrations  may  be 
normalized  to  those  of  a  reference  element  (such  as  Al  or  Li;  Schropp  et  al.  1990;  Coring 
1991)  or  to  grain  size  (Chapman  1992).  In  some  cases,  the  concentrations  of  contami¬ 
nants  in  porewater  and  elutriate  samples  are  also  determined.  Some  of  these  normaliza¬ 
tion  procedures  are  intended  to  better  define  the  bioavailable  fraction  of  the  substance 
under  consideration,  whereas  others  provide  a  means  of  comparing  contaminant  concen¬ 
trations  to  background  levels. 

The  uncertainty  associated  with  the  following  measures  of  sediment  chemistry  was  evalu¬ 
ated:  1)  whole  sediment  analysis  using  total  extraction  of  sediments  (Forstner  1990);  2) 
normalization  of  nonionic  organic  contaminants  to  total  organic  carbon  concentration 
of  sediment  (Di  Toro  et  al  1991);  3)  metal  speciation  as  derived  by  AVS  or  by  evaluating 
other  partitioning  phases  (Di  Toro  et  al  1992);  4)  concentration  of  contaminants  in  pore- 
water  samples  (Adams  1991);  5)  concentrations  of  contaminants  in  elutriate  samples 
(USEPA-USACE 1991, 1994);  and  6)  concentrations  of  reference  elements  {e.g.,  Al  and  Li; 
Schropp  et  al  1990;  Loring  1991)  which  are  regional  reference  levels  to  which  contami¬ 
nant  concentrations  are  compared.  These  methods  are  also  used  to  eliminate  particle  size 
and  other  natural  interferences  (these  include  Al  or  Li  normalization  or  sieving  to  remove 
particles  >  64  pm  diameter;  Chapman  1992). 

These  categories  consider  a  variety  of  sediment  phases  commonly  used  to  assess  contami¬ 
nation  (Table  18-6).  In  particular,  measurements  of  sediment  chemistry  have  been  used 
as  an  initial  screen  to  identify  sediments  and  substances  of  potential  concern.  The  avail¬ 
able  information  does  not  support  a  quantitative  evaluation  of  the  uncertainty  associated 
with  each  of  the  sediment  chemistry  measures  considered  in  Table  18-6.  For  this  reason, 
an  evaluation  of  the  relative  uncertainty  of  each  approach  was  conducted  to  provide  guid¬ 
ance  on  the  selection  of  screening  tools.  It  should  be  noted  that  this  assessment  addresses 
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the  uncertainty  associated  with  the  use  of  sediment  chemistry  alone  in  sediment  assess¬ 
ments.  A  lower  level  of  uncertainty  would  be  assigned  to  several  of  the  chemistry  mea¬ 
sures  if  they  were  used  in  combination  with  other  measurement  endpoints  (e.g.,  toxicity 
tests,  benthic  community  assessments,  SQGs).  The  workgroup  also  attempted  to  account 
for  uncertainty  associated  with  each  analytical  chemistry  measurement  (i.e.,  whole  sedi¬ 
ment  PCBs)  and  uncertainty  associated  with  the  use  of  the  chemistry  measurement  alone 
to  interpret  sediment  contamination  (ie.,  OC  normalization  of  PCBs). 

18.6.1  Precision 

Precision  was  defined  in  terms  of  the  robustness  of  the  analytical  method.  That  is,  proce¬ 
dures  that  generate  similar  concentrations  in  repeated  analyses  of  the  same  samples  were 
considered  to  have  a  lower  level  of  uncertainty  than  those  that  generate  variable  results. 
The  lowest  level  of  uncertainty  was  assigned  to  whole  sediment,  TOC-normalization, 
SEM/AVS,  elutriate,  and  reference  element  measurements  because  a  high  level  of  preci¬ 
sion  can  be  attained  using  existing  analytical  methods,  Porewater  chemistry  and  proce¬ 
dures  intended  to  determine  the  species  of  contaminant  present  in  the  sample 
(speciation  procedures)  were  assigned  a  higher  level  of  uncertainty  resulting  from  the 
lack  of  routine  methods  used  in  these  analyses.  Rigorous  determinations  of  metals  in 
pore  waters  require  working  with  small  sample  volumes  (which  creates  challenges  in  pre¬ 
concentrating  to  measurable  concentrations).  All  determinations  of  metals  require 
sample  preparation  and  analyses  be  conducted  in  ultra-clean  conditions  (e.g.,  concentra¬ 
tions  in  the  pore  water  or  elutriates  can  be  in  the  low-to-sub  pg/l  range,  which  creates 
possibilities  for  contamination).  If  sediments  are  anoxic,  these  procedures  also  require 
the  collection  and  extraction  of  pore  water  under  completely  anoxic  conditions  (to  pre¬ 
vent  precipitation  of  Fe(II)).  Failure  to  rigorously  maintain  these  requirements  results  in 
imprecise  results,  in  addition  to  inaccuracies. 

18.6.2  Ecological  relevance 

With  respect  to  the  evaluation  of  chemistry  measures,  ecological  relevance  was  evaluated 
in  terms  of  linkages  to  receptors  that  are  to  be  protected.  Whole  sediment  chemistry,  elu¬ 
triates,  and  reference  elements  were  rated  low  since  these  approaches  are  not  based  on 
measures  of  bioavailability  or  are  not  direct  measures  of  ecological  relevance.  Total  or¬ 
ganic  carbon,  SEM/AVS,  metal  speciation,  and  porewater  measures  were  rated  with  a 
moderate  level  of  uncertainty  since  these  measures  are  based  on  evaluating  the 
bioavailable  faction  of  a  chemical  in  sediment.  All  of  the  chemistry  measures  listed  in 
Table  18-6  are  most  useful  when  used  in  combination  with  approaches  that  directly  mea¬ 
sure  biological  effects  (Sections  18.2  and  18.3). 

18.6.3  Causality 

Determination  of  causality  (i.e.,  correctly  identifying  stressors)  can  be  of  fundamental 
importance  in  SERAs.  For  this  reason,  uncertainty  in  each  of  the  chemical  measurements 
was  evaluated  in  terms  of  their  ability  to  determine  specific  linkage  to  a  contaminant  of 
concern  or  to  sources  of  chemical  contaminants.  Low  uncertainty  was  assigned  to  all  of 
the  measures  of  sediment  chemistry,  except  those  which  determined  chemical  concentra- 
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Table  18-6  Uncertainty  associated  with  sediment  chemistry  measurements 


Evaluation  criteria 

Bulk 

sediment 

TOC  norm¬ 
alization 

SEM  metals 
&AVS 

Metal 

speciation 

(nonAVS) 

Pore 

water 

Elutriate 

Reference 

element 

Precision 

1 

1 

1 

2* 

2* 

1 

1 

Ecological  relevance 

3 

2 

2 

2 

2 

3 

3 

Causality: 

1 

1 

1 

1 

2 

3 

1 

contaminant 

Causality:  source 

2* 

2 

2 

2 

2 

3 

1 

Sensitivity 

1* 

1* 

1* 

1* 

1* 

1* 

1* 

Interference 

2* 

2* 

2* 

2* 

2* 

2* 

2* 

Standardization 

1* 

1* 

1* 

3* 

2* 

2* 

1* 

Discrimination 

1 

1 

1 

2* 

1 

1 

1 

Bioavailability 

2* 

1 

1* 

2* 

2* 

3 

2 

Field  validation^ 

1 

2 

2* 

2* 

3 

3 

1 

1  =  low  uncertainty  (good) 

3  =  high  (bad) 

*  =  lack  of  knowledge 
a  =  not  related  to  field  sampling 


tions  in  sediment  elutriates.  The  elutriate  procedure  provides  a  measure  of  the  contami¬ 
nants  that  are  released  during  the  suspension  of  sediment  in  water.  Therefore,  prepara¬ 
tion  of  elutriates  alters  the  sediment  sample,  increasing  the  uncertainty  in  the  sediment 
contaminant  concentration.  For  this  reason,  elutriate  samples  do  not  provide  a  direct 
measure  of  the  chemical  characteristics  of  sediments  in  situ.  Although  porewater  concen¬ 
trations  provide  more  direct  linkages  to  whole  sediment  chemistry,  the  procedures  used 
to  isolate  pore  water  may  also  introduce  uncertainty  (USEPA  1994a;  ASTM  1995b). 

Identification  of  the  sources  of  particular  classes  of  chemical  is  central  to  the  SERA  pro¬ 
cess.  In  this  respect,  whole  sediment  chemistry  and  reference  element-based  procedures 
were  considered  to  provide  useful  measures  for  evaluating  contaminant  sources,  particu¬ 
larly  for  certain  classes  of  organics  {e.g.,  PAHs)  and  for  metals  (Schropp  etal  1990;  Envi¬ 
ronment  Canada  1995).  In  contrast,  elutriate  chemistry  provides  limited  information 
regarding  the  chemical  composition  of  sediments  in  situ  or  contaminant  sources  (ASTM 
1995d). 

18.6.4  Sensitivity 

Sensitivity  is  an  important  criterion  for  evaluating  the  applicability  of  various  measure¬ 
ments  of  sediment  chemistry  in  SERAs  because  there  is  a  need  to  reliably  identify  sedi¬ 
ments  with  high,  moderate,  and  low  contaminant  concentrations.  Most  analytical 
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methods  for  determining  chemical  concentrations  in  sediments  are  very  sensitive.  How¬ 
ever,  systematic  insensitivities  (false  negatives)  and  high  variability  may  occur  in  interpre¬ 
tation  of  metal  contamination  in  whole  sediments  if  samples  are  collected  from  a  range 
of  particle  sizes.  Metal  concentrations  are  surface-area  dependent  and  in  general  are 
lower  in  sandy  sediments  (Schropp  etal  1990).  Use  of  reference  element  normalization, 
sieving,  or  a  combination  of  sieving  and  reference  element  normalization  can  greatly  re¬ 
duce  this  uncertainty  (Loring  1991).  TOC-normalization  may  also  help  reduce  the  possi¬ 
bility  of  false  negatives  regarding  organic  contamination  in  instances  where  sediments 
are  low  in  TOC  (Di  Toro  et  al  1991).  SEM/AVS  and  other  metal  speciation  procedures 
can  reduce  uncertainties  about  effects  of  natural  processes  on  metal  concentrations  (Di 
Toro  etal  1992).  For  example,  both  Cd  and  Ag naturally  accumulate  to  high  concentra¬ 
tions  in  sulfide-rich  deposits  which  occur  in  unpolluted  conditions  (e.g.,  deep  oceans). 
Similarly,  elevated  levels  of  petroleum  hydrocarbons  can  occur  near  natural  oil  seeps,  and 
dioxin  can  occur  near  natural  combustion  sources  (e.g.,  forest  fires).  Recognizing  natu¬ 
rally  high  concentrations  of  sulfide,  iron,  or  organic  materials  could  prevent  conclusions 
that  serious  contamination  sources  are  present  when  natural  processes  are  responsible 
for  contaminant  concentrations. 

18.6.5  Interferences 

With  respect  to  chemical  measurements,  interferences  are  considered  to  be  factors  which 
impair  accurate  determination  or  interpretation  of  the  concentrations  of  contaminants 
in  sediment  samples.  In  most  cases,  interferences  are  related  to  sample  matrix  problems 
and  are  analyte  specific  in  the  categories  listed  in  Table  18-6.  Interpretation  interferences 
include  particle  size  variability  and  anomalous  high  concentrations  of  natural  sediment 
components  that  equilibrate  with  high  concentrations  of  contaminants.  Although  rarely 
quantified,  hydrodynamic  and  hydrologic  processes  can  also  interfere  with  interpretation 
of  the  methods  listed  in  Table  18-6.  For  example,  PAHs  from  local  sources  occur  in  lower 
concentrations  relative  to  TOC  in  San  Francisco  Bay,  where  sedimentation  rates  are  the 
highest.  High  inputs  of  PAH-poor  sediments  dilute  concentrations  of  the  contaminants 
in  bedded  sediments  compared  to  areas  where  deposition  of  the  PAH-poor  sediments  is 
lower  (W.  Pereira,  USGS,  unpublished  data).  Seasonal  variations  in  TOC  and  Fe  oxides 
also  appear  to  drive  seasonal  variations  in  metal  concentrations  in  surficial  San  Francisco 
Bay  sediments  (Luoma,  1990). 

In  the  Clark  Fork  River  and  other  mining-contaminated  rivers,  fluctuations  in  the  inten¬ 
sity  of  metal  desorption  from  sediments  is  biologically  driven.  Inputs  of  contaminated 
materials  in  these  systems  from  terrigenous  sources  caused  two-  to  four-fold  seasonal 
variations  in  sediment  metal  concentrations  (Brumbaugh  etal  1994).  Interferences  also 
add  important  uncertainties  to  the  determination  of  metal  species  in  oxidized  sediments. 
Many  of  the  standard  extraction  techniques  are  not  directly  selective  for  specific  metal 
forms,  although  valuable  interpretations  are  possible  if  specialized  techniques  are  used 
(e.g.,  determination  of  both  total  and  methyl  Hg). 
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18.6.6  Standardization 

Standard  methods  have  been  developed  for  virtually  all  of  the  analytical  procedures  con¬ 
sidered  in  this  assessment  (i.e.,  by  ASTM,  USEPA,  OECD,  Environment  Canada).  How¬ 
ever,  there  are  still  few  methods  available  that  can  effectively  speciate  metals  and 
metalloids  in  oxidized  sediments  or  that  can  be  used  to  measure  nonpriority  pollutants. 
As  metal  speciation  is  an  important  determinant  of  metal  toxicity,  more  research  in  this 
area  is  needed  to  reduce  uncertainty  in  SERAs.  In  addition,  the  standard  methods  for 
certain  substances  need  to  be  refined  to  achieve  biologically  relevant  detection  limits. 

18.6.7  Discrimination 

In  the  previous  applications,  this  criterion  was  intended  to  evaluate  the  ability  of  a  mea¬ 
surement  to  discriminate  between  contaminated  sediments  by  providing  a  graded  re¬ 
sponse.  In  this  evaluation,  a  chemical  measure  was  considered  to  have  a  higher  degree  of 
uncertainty  if  it  did  not  provide  a  means  of  distinguishing  between  samples  which  had 
different  levels  of  contamination.  Analytical  methods  are  very  good  discriminators.  How¬ 
ever,  the  interpretational  uncertainties  described  above  for  whole  sediments  add  substan¬ 
tial  uncertainties  to  discrimination  of  contamination  using  this  method. 

18.6.8  Bioavaiiability 

For  evaluating  measures  of  sediment  chemistry,  procedures  were  considered  to  have  a 
lower  level  of  uncertainty  if  they  incorporated  factors  which  are  considered  to  influence 
the  bioavailability  of  sediment-associated  contaminants.  In  general,  such  factors  are  in¬ 
corporated  through  the  use  of  various  normalization  procedures.  Although  bulk  con¬ 
taminant  concentrations  do  not  explicitly  predict  the  bioavailable  fraction,  they  have 
been  shown  in  some  cases  to  be  generally  related  to  biological  responses  (Long  et  al 
1995;  Ingersoll  etal  1996).  The  relations  are  probably  most  common  across  sediments 
where  contaminants  occur  in  a  similar  geochemical  form  and  particle  size  biases  are 
small.  The  TOC-  and  AVS-normalization  procedures  often  reduce  the  level  of  uncertainty 
about  the  bioavailability  of  nonionic  organics  and  metals  respectively  (Di  Toro  et  al  1990, 
1991).  Iron-  and  TOC-normalization  procedures  also  have  been  used  to  predict  the  bio¬ 
availability  of  metals  in  some  estuarine  and  riverine  habitats  (Luoma  and  Bryan,  1978, 
1982;  Tessier  et  al  1984).  Exceptions  to  the  successful  use  of  normalization  to  predict  bio¬ 
availability  occur  in  the  literature,  resulting  in  some  uncertainty  about  how  broadly 
these  approaches  can  be  applied  to  dynamic  natural  systems  (USEPA  1990a,  1996b). 
1991, 1992, 1995).  Elutriate  preparation  tends  to  alter  bioavailability  in  unpredictable 
ways  and,  therefore,  increases  uncertainty. 

18.6.9  Field  validation 

Field  validation  is  an  essential  element  of  the  SERA  process.  With  respect  to  the  evalua- 
tiofi  of  sediment  chemistry  measures,  field  validation  was  interpreted  in  terms  of  the  ac¬ 
curacy  of  the  method.  That  is,  the  uncertainty  about  the  extent  to  which  measurements 
of  sediment  chemistry  reflect  actual  field  concentrations  of  contaminants  was  evaluated. 
Whole  sediment  chemistry  and  reference  element  concentrations  have  low  uncertainty 
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with  respect  to  accuracy  because  these  methods  have  well-established  QA/QC  proce¬ 
dures,  and  because  they  are  applied  to  samples  collected  in  the  field,  uncertainty  about 
their  field  relevance  is  low.  However,  a  number  of  uncertainties  are  associated  with  the 
analysis  of  inorganics  (z.c.,  AVS  or  metal  speciation;  USEPA  1990a,  1992, 1995)  and  with 
elutriates.  These  uncertainties  stem  from  alterations  of  the  sediments  which  organisms 
are  exposed  to  in  situ,  resulting  from  sample  collection,  storage,  laboratory  treatment,  or 
other  methodological  procedures. 

13.6.10  Summary 

Depending  on  the  objectives  of  the  study,  most  of  the  methods  examined  in  Table  18-6 
represent  useful  measurements  for  conducting  SERAs.  However,  the  high  level  of  uncer¬ 
tainty  associated  with  elutriate  chemistry  makes  interpretation  of  these  data  difficult. 
This  concern  is  particularly  associated  to  ecological  relevance,  causality,  bioavailability, 
and  field  validation. 

18.7  Evaluation  of  uncertainty  associated  with  use  of 
sediment  quality  criteria  and  guidelines 

Numerical  SQGs  are  useful  tools  for  conducting  ERAs  and  HHRAs.  Specifically,  SQGs  for 
the  protection  of  aquatic  life  provide  a  basis  for  screening  sediment  chemistry  data  to 
evaluate  the  potential  for  adversely  affecting  benthic  organisms.  Similarly,  residue-based 
SQGs  provide  a  means  of  determining  if  sediment-associated  contaminants  pose  a  poten¬ 
tial  risk  to  human  health  or  wildlife  (z>.,  fish,  birds).  Although  SQGs  are  widely  used  in 
sediment  quality  assessments,  the  nature  and  extent  of  the  uncertainty  associated  with 
these  measures  are  often  overlooked.  Nonetheless,  this  uncertainty  can  be  substantial 
and  has  the  potential  to  influence  the  results  of  the  risk  assessment.  For  this  reason,  a 
comparative  assessment  of  the  relative  uncertainty  associated  with  selected  SQGs  and 
related  measurement  endpoints  was  conducted  (Table  18-7).  Recommendations  for  re¬ 
ducing  the  uncertainty  associated  with  these  tools  were  also  identified. 

Sediment  quality  guidelines  are  developed  both  for  the  protection  of  aquatic  organisms 
and  for  the  protection  of  human  health  and  wildlife.  A  variety  of  different  approaches 
have  been  used  to  derive  numerical  SQGs  for  the  protection  of  aquatic  organisms;  how¬ 
ever,  all  of  these  approaches  fall  into  three  primary  categories:  equilibrium  partitioning, 
co-occurrence  (z.c.,  derived  primarily  from  field-collected  sediment  samples),  and  mod¬ 
els.  Only  the  following  SQGs  are  considered  in  Table  18-7  because  these  guidelines  have 
been  used  extensively  in  a  variety  of  sediment  assessment  applications: 

•  USEPA  sediment  quality  criteria  (Hansen  et  al  1993a-1993e).  To  date,  the  EqP 
approach  has  been  used  to  derive  draft  SQC  for  acenaphthene,  phenanthrene, 
fluoranthene,  dieldrin,  and  endrin.  However,  the  approach  is  amenable  to  many 
nonionic  organic  substances. 

•  Effects  range  low  and  effects  range  median  (Long  and  Morgan  1990;  Long  and 
MacDonald  1992;  Long  et  al.  1995;  Ingersoll  et  al.  1996).  These  informal  SQGs 
were  derived  from  an  information  system  containing  data  from  EqP  models, 
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Table  18-7  Uncertainty  associated  with  sediment  quality  guidelines 


Evaluation  criteria 

SQC 

ERL 

&ERM 

AET 

SLC 

SEM/AVS 

TU  models 

Residue  - 
based  SQG 

Precision 

1 

2* 

3 

3 

2* 

3* 

2 

Ecological  relevance 

2* 

1 

2 

3 

1 

1 

2*(b) 

Causality 

1 

3*(a) 

3 

3 

1 

1* 

2 

Sensitivity 

2 

l(c)/2 

3 

1 

2* 

2 

1 

Interference  (d) 

2 

2* 

2 

3 

2* 

2 

1 

Standardization 

1 

1 

2 

2 

1 

3 

1 

Discrimination  (e) 

1 

1 

3 

3 

2 

2 

1 

Bioavailability 

1 

2*(e) 

2*(e) 

1 

1 

2* 

1* 

Field  validation 

2* 

2* 

2* 

3* 

2* 

2 

2* 

1  =  low  uncertainty  (good) 

3  =  high  (bad) 

*=  lack  of  knowledge 

a  =  Uncertainty  can  be  reduced  through  use  of  TU  models,  which  help  to  identify  chemicals  most  strongly  associated  with 
observed  effects. 

b=  Few  compounds,  based  on  consumption  effects 
c  =  ERL 

d  =  Interferences  resulting  from  community  responses  and  mixture  effects 
e  =  With  normalization 


laboratory  spiked-sediment  toxicity  tests,  or  field  studies.  The  threshold  and 
probable  effect  levels  (TELs  and  PELs)  reported  by  MacDonald  (1994)  and 
MacDonald  et  al  (1996)  were  considered  to  be  functionally  similar  to  the  ERLs 
and  ERMs  and,  therefore,  were  not  explicitly  considered  in  this  evaluation. 

•  Washington  State’s  sediment  management  standards  (Washington  Department 
of  Ecology  1990).  These  standards  are  based  on  the  AETs  that  have  been 
developed  using  the  results  of  toxicity  tests  and  benthic  community  assessments 
(Barrick  et  al  1988). 

•  Screening-level  concentrations  (SLCs)  (Neff  et  1986;  Persaud  et  al  1992). 
These  SC^Gs  were  derived  using  matching  sediment  chemistry  and  benthic 
community  data  from  various  geographic  areas. 

•  SEM/AVS  (Di  Toro  et  al  1990, 1992;  USEPA 1995).  Using  this  procedure,  metals 
are  not  considered  to  contribute  to  any  observed  toxicity  when  their  molar 
concentration  is  less  than  that  of  AVS. 

•  Toxic  units  models  (Chapter  16).  Toxic  units  models  typically  assume  that  the 
effects  of  contaminants  are  additive  and  are  based  on  comparisons  of  chemical 
concentrations  to  toxicity  benchmarks  ie.g.,  LC50s,  SQGs;  Swartz  et  al  1995). 
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Toxicity  may  be  predicted  when  the  sum  of  the  ratios  calculated  for  each 
substance  exceeds  one. 

•  Residue-based  SQGs  (Cook  et  al.  1992).  These  SQGs  are  derived  by  determining 
the  chemical  concentrations  in  sediments  which  are  predicted  to  result  in 
acceptable  tissue  residues.  Using  this  approach,  appropriate  tissue  residue 
guidelines  (such  as  FDA  action  levels  for  the  protection  of  human  health 
[USEPA  1989a]  or  the  Niagara  River  fish  flesh  criteria  for  the  protection  of 
piscivorous  wildlife  [Newell  et  al  1987])  are  used  in  conjunction  with  biota-to- 
sediment  accumulation  factors  (BSAFs)  to  derive  the  S(XG. 

As  all  of  the  approaches  listed  above  have  been  reviewed  and  summarized  elsewhere, 
they  have  not  been  described  in  this  document.  Instead,  the  reader  is  directed  to 
Chapman  (1989),  Beak  Consultants  Ltd.  (1987, 1988),  USEPA  (1989b,  1990a,  1996b). 
1992, 1993, 1995),  MacDonald  etal  (1992),  Persaud  etal  (1992),  and  MacDonald  (1995) 
for  additional  information  on  these  approaches. 

The  available  information  does  not  support  a  quantitative  evaluation  of  the  uncertainty 
associated  with  each  of  the  SQGs  considered  in  Table  18-7.  For  this  reason,  an  evaluation 
of  the  relative  uncertainty  of  each  approach  was  conducted  to  provide  guidance  on  the 
selection  of  screening  tools.  The  evaluation  of  uncertainty  in  Table  18-7  was  conducted 
using  the  same  general  criteria  which  were  used  in  the  evaluation  of  the  other  measure¬ 
ment  endpoints  in  Tables  18-1  to  18-3  and  Tables  18-5  and  18-6.  However,  these  criteria 
were  refined  to  increase  their  applicability  to  SQGs. 

18.7.1  Precision 

Precision  was  evaluated  in  terms  of  the  robustness  of  the  guidelines  derivation  procedure. 
That  is,  procedures  which  generate  similar  guidelines  using  different  datasets  {i.e.,  from 
different  geographic  areas)  were  considered  to  have  a  lower  level  of  uncertainty  than 
those  which  generate  different  guidelines  using  similar  datasets.  Precision  provides  a 
measure  of  the  broad  applicability  of  the  SQGs.  In  terms  of  precision,  the  lowest  level  of 
uncertainty  was  assigned  to  the  SQC  because  of  the  extensive  toxicology  database  on 
which  they  were  derived.  High  uncertainty  was  assigned  to  AETs  and  SLCs  because  of  the 
site-specificity  associated  with  their  derivation.  Although  a  moderate  level  of  uncertainty 
was  assigned  to  the  ERLs  and  ERMs,  advances  in  the  development  of  the  databases  upon 
which  they  depend  will  hopefully  reduce  this  uncertainty  in  the  near  future.  A  moderate 
level  of  uncertainty  was  also  assigned  to  the  SEM/AVS-based  guidelines  because  of  the 
micro-spatial  distribution  of  AVS,  particularly  in  relation  to  biogenic  structures.  It  is 
important  to  note  that  the  mixture  models  (e.g.,  Swartz  et  al  1995;  Chapter  16)  do  not 
have  an  inherent  high  level  of  uncertainty,  but  instead  the  methods  are  not  yet  fully  de¬ 
veloped.  Hence,  it  is  not  yet  possible  to  determine  the  uncertainty  associated  mixture 
models  as  they  apply  to  assessment  of  sediments. 

18.7.2  Ecological  relevance 

With  respect  to  the  evaluation  of  SQGs,  ecological  relevance  was  evaluated  in  terms  of  its 
linkage  to  the  receptors  that  are  to  be  protected.  Therefore,  SQGs  that  are  based  on  data 
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collected  in  the  field  or  in  the  laboratory  on  appropriate  species  were  considered  to  have 
less  uncertainty  than  those  that  are  based  primarily  on  modeling.  The  effects  of  mixtures 
of  contaminants  in  sediments  is  critically  important  to  evaluations  of  ecological  rel¬ 
evance.  Guidelines  which  directly  consider  mixtures  were  assigned  a  relatively  low  level 
of  uncertainty  {e.g.,  mixture  models,  AVS/SEM  guidelines,  and  the  ERL/ERM  guidelines 
derived  using  data  from  the  field  which  included  contaminant  mixtures).  Existing  SQC 
do  not  consider  mixtures  and,  hence,  were  assigned  a  moderate  level  of  uncertainty.  Simi¬ 
larly,  AETs  were  assigned  a  moderate  level  of  uncertainty  because  of  their  inherent  poten¬ 
tial  for  incorrectly  identifying  toxic  samples  as  not  toxic  {i.e.,  false  negatives).  The  SLCs 
reflect  the  lower  bound  of  ecologically  relevant  sediment  concentrations  (z>.,  background 
concentrations)  but  may  not  necessarily  define  actual  effect  concentrations  {i.e.,  false 
positives,  nontoxic  samples  identified  as  toxic).  Although  the  tissue  residue  guidelines 
with  which  the  residue-based  SQGs  were  derived  are  considered  to  be  highly  ecologically 
relevant,  more  uncertainty  is  associated  with  the  models  which  are  used  to  determine  the 
BSAFs.  Therefore,  a  moderate  level  of  uncertainty  was  assigned  to  residue-based  SQGs. 

18.7.3  Causality 

Determination  of  causality  {i.e.,  correctly  identifying  stressors)  is  of  fundamental  impor¬ 
tance  in  SERAs.  For  this  reason,  uncertainty  in  the  SQGs  was  also  evaluated  in  terms  of 
their  ability  to  determine  or  predict  the  factors  causing  adverse  effects  in  contaminated 
sediments.  That  is,  a  lower  level  of  uncertainty  was  assigned  to  SQGs  that  directly  iden¬ 
tified  stressors.  The  SQC,  SEM/AVS,  and  mixture  models  were  assigned  low  uncertainty 
because  they  are  directly  derived  from  experimental  determinations  of  effects  of  specific 
chemicals.  In  contrast,  ERLs  and  ERMs,  AETs,  and  SLCs  were  assigned  higher  levels  of 
uncertainty  because  these  guidelines  are  derived  primarily  from  field  observations  in 
which  cause-and-effect  relations  were  equivocal  {i.e.,  the  sediments  contained  mixtures  of 
contaminants  and,  hence,  it  is  difficult  to  determine  the  causative  agents  directly). 

18.7.4  Sensitivity 

Sensitivity  is  an  important  characteristic  of  a  measurement  because  there  is  a  need  to 
reliably  identify  sediments  with  high,  moderate,  and  low  probabilities  of  causing  adverse 
biological  effects.  Therefore,  SQGs  were  considered  to  have  a  relatively  low  level  of  uncer¬ 
tainty  in  this  respect  if  they  reliably  estimated  effects  at  relatively  low  contaminant  con¬ 
centrations  {i.e.,  minimize  false  negatives  while  allowing  for  a  higher  probability  of  false 
positives;  Long  et  al  1995;  Ingersoll  et  al.  1996;  MacDonald  et  al.  1996).  The  need  for 
sensitivity  {e.g.,  minimize  false  negatives)  should  be  balanced  with  ecological  relevance 
{e.g.,  minimize  both  false  positives  and  false  negatives  (Section  18.7.2)).  Low  uncertainty 
with  respect  to  sensitivity  was  assigned  to  the  ERLs  and  SLCs  because  they  tend  to  be  the 
lowest  SQGs.  Most  of  the  other  SQGs  were  considered  to  have  a  relatively  higher  level  of 
uncertainty  because  they  are  generally  higher  values  {e.g.,  SQC,  ERMs,  and  SEM/AVS). 
The  AETs  were  assigned  a  high  level  of  uncertainty  with  respect  to  sensitivity  because 
they  only  increase  with  the  addition  of  new  data,  making  them  particularly  prone  to  false 
negatives.  In  contrast,  the  residue-based  SQGs  were  considered  to  have  a  lower  level  of 
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uncertainty  because  the  tissue  residue  guidelines  are  based  on  the  results  of  chronic  tox¬ 
icity  tests  on  sensitive  species. 

18.7.5  Interferences 

In  the  context  of  the  SQG  evaluation,  interferences  are  considered  to  be  related  to  biotic 
or  abiotic  factors  that  could  influence  the  SQGs  derivation  beyond  the  direct  effects  of 
specific  contaminants.  That  is,  interferences  are  considered  to  be  nontreatment  factors. 
Benthic  communities  can  be  strongly  influenced  by  environmental  factors  such  as  grain 
size,  TOC,  or  salinity.  These  same  factors  can  also  influence  the  results  of  some  sediment 
toxicity  tests;  however,  careful  experimental  design  can  minimize  these  effects  (DeWitt  et 
ah  1988).  Because  the  SLCs  are  based  entirely  on  benthic  community  data,  they  were 
considered  to  have  the  highest  level  of  uncertainty.  Because  the  residue-based  guidelines 
are  from  direct  analytical  determination,  they  are  not  subject  to  the  same  types  of  inter¬ 
ferences.  Hence,  a  relatively  low  levels  of  uncertainty  were  assigned  to  these  measurement 
endpoints.  The  influence  of  sediment  characteristics  such  as  surface  area,  mineral  inter¬ 
actions,  and  redox  are  not  typically  quantified  and  may  add  an  additional  amount  of 
uncertainty  to  each  of  the  SQGs  listed  in  Table  18-7. 

18.7.6  Standardization 

Standard  procedures  have  been  established  for  deriving  virtually  all  of  the  SQG  values 
listed  in  Table  18-7.  In  this  context,  uncertainty  in  the  degree  of  standardization  of  the 
approach  was  evaluated  on  the  basis  of  appropriate  peer  review.  That  is,  the  degree  of 
uncertainty  was  considered  to  be  lower  for  those  approaches  that  have  been  published  in 
the  peer-reviewed  literature.  Approaches  for  determination  of  SQC,  ERLs  and  ERMs,  and 
SEM/AVS  have  been  published  in  the  peer-reviewed  literature  and,  hence,  were  assigned 
a  low  degree  of  uncertainty.  In  contrast,  the  mixture  models  (in  the  early  stages  of  devel¬ 
opment  with  sediments),  tissue  residue  guidelines,  and  AETs  have  not  been  widely  peer 
reviewed  in  the  literature. 

18.7.7  Discrimination 

In  the  previous  sections,  this  criterion  was  intended  to  evaluate  the  ability  of  a  measure¬ 
ment  endpoint  to  discriminate  between  contaminated  sediments  by  providing  a  graded 
response.  That  is,  a  measure  was  considered  to  have  a  higher  degree  of  uncertainty  if  it 
did  not  provide  a  means  of  distinguishing  between  samples  that  had  different  levels  of 
contamination.  In  the  context  of  this  evaluation,  SQGs  were  considered  to  be  discrimina¬ 
tory  if  they  could  be  used  to  correctly  classify  toxic  and  nontoxic  samples  (Long  et  al 
1995).  The  SQC  and  the  ERLs  and  ERMs  have  been  demonstrated  to  provide  accurate 
measures  for  correctly  predicting  toxic  and  nontoxic  responses  in  the  field.  For  this  rea¬ 
son,  a  low  level  of  uncertainty  was  assigned  to  these  guidelines.  Although  SLCs  are  de¬ 
signed  to  be  highly  protective,  they  have  a  poor  ability  to  discriminate  the  range  of 
adverse  effects  that  could  occur.  In  contrast,  sediment  samples  with  contaminant  concen¬ 
trations  that  exceed  the  AETs  have  a  high  probability  of  being  toxic.  However,  the  AETs 
may  not  reliably  discriminate  samples  with  lower  levels  of  contamination  with  respect  to 
their  potential  for  adverse  biological  effects  {i.e.,  false  negatives).  The  residue-based 
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guidelines  are  based  on  accurate  analytical  measurements;  therefore,  tissue  residues  pro¬ 
vide  a  reliable  basis  for  discriminating  between  the  chemicals  that  could  be  causing  ad¬ 
verse  effects.  However,  these  guidelines  are  available  for  only  a  limited  number  of 
substances.  It  is  recommended  that  residue-based  guidelines  be  derived  for  additional 
priority  bioaccumulative  substances.  Additional  studies  should  evaluate  the  ability  of  the 
SQGs  listed  in  Table  18-7  to  minimize  both  false  negative  and  false  positive  errors. 

18.7.8  Bioavailabilily 

Information  from  a  variety  of  sources  indicates  that  the  bioavailability  and,  hence,  toxic¬ 
ity  of  sediment-associated  contaminants  can  be  influenced  by  the  characteristics  of  the 
sediments  and  the  associated  pore  water  (Pavlou  1987;  Swartz  et  al  1990;  Di  Toro  et  al 
1991, 1992).  For  this  reason,  SQGs  that  explicitly  considered  the  influence  of  these  factors 
were  deemed  to  have  a  lower  level  of  uncertainty  than  those  that  did  not  address  bioavail¬ 
ability  (Di  Toro  etal  1991).  The  factors  which  are  considered  to  influence  bioavailability 
are  directly  considered  in  the  derivation  of  the  SQC,  SLCs,  SEM/AVS,  and  residue-based 
guidelines.  Although  other  guidelines  {i.e.,  ERLs,  ERMs,  AETs,  and  mixture  models)  are 
largely  based  on  dry-weight  concentrations,  it  is  possible  to  refine  the  approaches  to  ex¬ 
plicitly  consider  other  normalization  procedures  (Ingersoll  et  al  1996).  Such  research  is 
in  progress  and  should  reduce  the  uncertainty  associated  with  these  methods  in  the  near 
future. 

18.7.9  Field  validation 

Field  validation  is  an  essential  element  of  the  SQGs  development  process  because  it  pro¬ 
vides  a  basis  for  assessing  their  reliability  in  sediment  quality  assessments.  For  this  rea¬ 
son,  a  higher  degree  of  uncertainty  in  the  application  of  SQGs  to  SERAs  was  assigned  to 
values  if  they  had  not  been  appropriately  field  validated.  At  a  minimum,  such  a  field  vali¬ 
dation  would  require  an  assessment  of  the  predictability  of  the  SQGs  using  a  number  of 
independent  datasets  {i.e.,  not  used  to  derive  the  SQGs;  Long  et  al  1995;  Ingersoll  et  al 
1996;  MacDonald  et  al  1996).  All  SQGs  suffer  from  a  lack  of  comprehensive  field  valida¬ 
tion.  Although  preliminary  evaluations  of  several  of  the  guidelines  have  been  conducted 
{e.g.,  SQC,  ERL,  ERMs,  AETs,  and  SEM/AVS),  the  results  apply  to  a  limited  number  of 
geographic  areas  and  sediment  quality  conditions.  It  is  recommended  that  SQGs  be  fur¬ 
ther  evaluated  using  the  results  of  field  studies  conducted  in  various  geographic  regions. 
This  is  the  most  important  general  research  need  for  reducing  the  uncertainty  associated 
with  the  guidelines. 

18.7.10  Summary 

The  results  of  this  evaluation  indicate  that  there  is  sufficient  certainty  associated  with 
SQGs  to  recommend  their  use  in  SERAs.  In  particular,  SQC,  ERLs,  ERMs,  SEM/AVS,  and 
residue-based  SQGs  generally  have  less  uncertainty  in  their  present  applications  than 
other  guidelines.  Although  mixture  models  were  generally  considered  to  have  somewhat 
higher  levels  of  uncertainty,  they  address  the  critically  important  issue  of  the  interaction 
of  contaminants  in  complex  mixtures.  For  this  reason,  it  is  strongly  recommended  that 
further  development  of  SQGs  encompassing  mixture  models  be  pursued.  In  spite  of  the 
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uncertainty  associated  with  the  general  application  of  AETs,  their  regional  application 
has  proven  to  be  effective.  Screening-level  concentrations  accurately  define  the  lower 
bound  for  sediment-associated  contaminants;  however,  they  should  not  be  used  in 
SERAs  because  they  cannot  discriminate  between  environmentally  relevant  contaminant 
concentrations.  Overall,  TIE  methods  offer  a  logical  solution  for  bridging  the  gap  be¬ 
tween  effects  and  exposure  assessments  and  would  result  in  a  more  accurate  and  compre¬ 
hensive  characterization  of  risk  (Chapter  16).  A  concern  expressed  with  current  risk 
assessment  frameworks  was  that  there  does  not  appear  to  be  any  type  of  defined  “box” 
(Chapter  1)  for  mixture  models  or  TIEs  described  in  Chapter  16,  which  could  greatly 
advance  the  science  of  (at  least  retrospective)  SERAs. 

Further  investigations  are  needed  to  reduce  uncertainty  in  SQGs  and  increase  their  appli¬ 
cability  in  SERAs.  For  example,  additional  studies  are  required  to  identify  factors  which 
control  the  bioavailability  of  sediment-associated  contaminants  under  various  condi¬ 
tions.  In  turn,  this  information  could  be  used  to  refine  the  SQGs  for  specific  applications. 
In  addition,  residue-based  SQGs  should  be  derived  for  a  number  of  substances  that  are 
known  to  bioaccumulate.  Furthermore,  all  of  the  SQGs  should  be  field  validated  to  deter¬ 
mine  their  predictive  ability  and  their  potential  for  resulting  in  false  positive  and  false 
negative  errors. 

18.8  Conclusions  and  recommendations 

18.8.1  Improving  sediment  ecological  risk  assessments  to  quantify  and 
account  for  uncertainty 

•  While  each  measurement  tool  has  an  inherent  level  of  uncertainty  associated 
with  its  application,  the  uncertainty  associated  with  the  overall  risk  assessment 
process  can  be  reduced  by  integrating  these  tools.  For  example,  the  use  of 
sediment  chemistry,  sediment  toxicity,  and  benthic  community  data  together  in 
a  sediment  quality  triad  can  be  used  to  establish  a  weight  of  evidence  linking 
contaminated  sediments  to  adverse  biological  effects  (Chapman  etal  1992). 
Similarly,  Long  et  al.  (1995)  used  sediment  chemistry  and  toxicity  data,  in 
conjunction  with  SQGs,  to  evaluate  sediment  quality  and  identify  contaminants 
that  were  likely  causing  adverse  biological  effects.  Therefore,  the  integration  of 
multiple  tools  has  the  potential  to  substantially  reduce  uncertainty  in  SERAs 
and  improve  management  decisions. 

•  Toxicity  tests  are  generic  indicators  of  stress  and  cannot,  by  themselves, 
discriminate  specific  factors  which  caused  the  response.  Linkage  of  results  of 
toxicity  tests  to  a  list  of  chemicals  in  the  sediment  sample  might  prove  to  be  of 
limited  use  for  defining  potential  causes  of  toxicity.  Chemicals  responsible  for 
toxicity  may  not  have  been  measured,  the  bioavailability  of  chemicals  can  be 
quite  uncertain,  and  correlative  techniques  are  often  unable  to  deal  with 
multiple  contributions  from  complex  mixtures. 

•  Although  TIES  and  mixture  models  were  generally  considered  to  have  somewhat 
higher  levels  of  uncertainty,  both  approaches  address  the  critically  important 
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issue  of  the  interaction  of  contaminants  in  complex  mixtures.  For  this  reason, 
further  development  of  these  approaches  should  be  pursued.  Overall,  TIE 
methods  offer  a  logical  solution  for  bridging  the  gap  between  effects  and 
exposure  assessments  and  would  result  in  a  more  accurate  and  comprehensive 
characterization  of  risk. 

•  The  most  ecologically  relevant  phases  of  sediment  used  in  toxicity  tests  from  the 
standpoint  of  uncertainty  were  whole  sediment  tests  with  benthic  organisms, 
followed  by  whole  sediment  tests  with  pelagic  species,  suspended  sediment 
tests,  and  porewater  tests.  In  the  case  of  the  organic  extract  and  elutriate  tests, 
the  exposure  regimes  do  not  represent  a  realistic  measure  of  sediment  in  situ 
conditions. 

•  The  uncertainty  associated  with  survival  as  the  endpoint  monitored  in  labora¬ 
tory  toxicity  tests  is  less  than  that  for  the  other  endpoints.  The  continued 
development  of  more  sensitive  and  ecologically  relevant  endpoints  {e.g.,  chronic 
effects  on  growth  and  reproduction,  life-cycle  tables)  has  the  potential  to 
produce  superior  measurement  endpoints  for  use  in  SERAs. 

•  Insufficient  information  exists  on  how  to  relate  the  response  of  endpoints  at  the 
level  of  the  individual  to  the  population  or  the  community.  However,  commu¬ 
nity-level  assessments  are  more  likely  to  include  sediment  effects  as  the  number 
of  species  examined  increases  the  probability  of  observing  a  sediment-associ¬ 
ated  response.  Therefore,  relationships  should  be  examined  between  responses 
measured  using  biomarkers  and  individuals  and  responses  measured  using 
populations  and  communities. 

•  Bioaccumulation  cannot  alone  provide  information  about  whether  the  source  of 
exposure  was  water,  sediment,  or  pore  water  and  cannot  alone  be  used  to 
evaluate  effects.  Combinations  of  laboratory  studies,  field  data  on  water  and 
sediment  concentrations,  and  food  webs  and  pathway  models  can  be  used  to 
increase  our  understanding  of  pathways  of  exposure  and  effects. 

•  An  inherent  uncertainty  exists  in  quantitatively  attempting  to  relate  laboratory 
bioaccumulation  results  to  the  field.  Nonetheless,  studies  have  successfully 
demonstrated  bioaccumulation  results  obtained  in  the  laboratory  with  field- 
collected  sediments  can  be  highly  predictive  of  residue  concentrations  observed 
in  synoptic  collection  of  organisms  from  the  field.  Combined  laboratory  and 
model  studies  show  great  potential  for  reducing  uncertainties  in  extrapolating 
from  controlled  studies  to  nature. 
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18.8.2  Strategic  actions  for  addressing  uncertainty  in  sediment  ecological 
risk  assessments 

Top  five  recommendations: 

•  Evaluate  the  ability  of  laboratory  responses  to  predict  responses  in  the  field. 

•  Evaluate  the  predictive  ability  {i.e.,  field  validation)  of  SQGs  in  a  variety  of 
geographic  regions,  including  the  potential  for  false  positive  and  false  negative 
errors. 

•  Develop  methods  for  evaluating  chronic  effects  of  contaminated  sediments  on 
growth,  reproduction,  and  population  dynamics. 

•  Adapt  TIE  procedures  used  to  evaluate  aqueous  samples  such  as  effluents  or 
pore  water  for  use  in  whole  sediment  toxicity  tests. 

•  Develop  residue-based  SQGs  for  highly  bioaccumulative  substances. 

Additional  recommendations: 

•  Use  a  weight-of-evidence  approach  to  reduce  uncertainty  associated  with 
SERAs. 

•  Develop  procedures  for  integrating  uncertainty  estimates  for  each  of  the  three 
measures  {i.e.,  biology,  chemistry,  toxicity)  in  the  sediment  quality  triad. 

•  Use  consistent  methods  when  possible  to  conduct  SERAs. 

•  Identify  biomarkers  and  their  relation  to  ecologically  relevant  endpoints  and  to 
specific  contaminants. 

•  Improve  sampling  methods  to  lower  uncertainty  in  estimates  of  spatial  and 
temporal  variability. 

•  Further  evaluate  factors  controlling  the  bioavailability  of  contaminants  in 
sediments. 

•  Determine  the  mode  of  toxicity  of  contaminants. 

•  Determine  the  extent  to  which  natural  systems  are  at  equilibrium  or  steady  state 
to  determine  if  models  that  rely  on  this  assumption  are  relevant. 

•  Further  evaluate  relative  sensitivity  of  the  species  commonly  used  in  sediment 
toxicity  assessments. 


•  Refine  approaches  for  identifying  reference  conditions  for  benthic  communities. 

•  Evaluate  the  altered  sensitivity  of  populations  exposed  to  stressors. 

•  Better  define  natural  variability  of  key  receptors  and  stressors. 

•  Conduct  studies  to  determine  the  precision  of  benthic  community  assessment 
methods. 

•  Verify  models  of  the  interactions  of  contaminant  mixtures  in  relation  to  in  situ 
biological  and  toxicological  effects  of  contaminated  sediments. 


Consider  interferences  which  decrease  the  predictive  ability  of  laboratory 
toxicity  tests  {e.g.,  UV  light  and  photoinduced  toxicity  of  PAHs). 

Further  elucidate  the  life  history  of  sensitive  species. 
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•  Evaluate  relations  between  body  burdens  and  toxic  responses. 

•  Further  evaluate  effects  of  dietary  exposures  to  contaminants,  particularly  for 
metals  and  PAHs. 

•  Field-validate  tiered  sediment  quality  assessment  approaches  to  assure  that  they 
provide  relevant  management  information. 
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Ecological  risk  assessment  for  sediments: 

an  Australasian  perspective 

D./.  Morrisey 


19.1  Introduction 

Ecological  risk  assessment  as  a  formal  process  is  at  an  early  stage  of  development  in  Aus¬ 
tralia  and  New  Zealand.  Consequently,  this  discussion  is  preliminary  in  nature  and  much 
of  it  is  likely  to  be  overtaken  by  events.  The  following  sections  consider  1)  the  current 
state  of  ERA  as  a  formal  process  in  Australia  and  New  Zealand,  with  particular  reference 
to  ERA  for  sediments  (SERA);  2)  particular  problems  of  assessment  of  ecological  effects 
in  Australia  and  New  Zealand;  and  3)  the  appropriateness  of  a  formal  framework  for 
SERA  in  situations  where  there  is  little  information  on  local  conditions.  Since  problems 
of  assessment  of  exposure  (routes  of  uptake  and  mode  of  contact  of  organisms  with  con¬ 
taminants)  to  a  given  concentration  of  a  contaminant  are  probably  the  same  in 
Australasia  as  in  other  parts  of  the  world  (as  discussed  in  other  chapters),  this  discussion 
will  focus  on  aspects  where  the  perspective  is  different,  notably  problem  formulation,  as¬ 
sessment  of  effects,  and  risk  management.  Lack  of  information  on  local  ecological  sys¬ 
tems  and  their  components  and  the  consequent  problem  of  assessing  the 
appropriateness  of  using  data  from  studies  done  overseas  are  relevant  to  the  first  two  of 
these.  The  relatively  small  concentrations  of  contaminants  generally  present  in  sediments 
in  Australasian  aquatic  habitats  compared  with  those  of  northern  Europe  and  North 
America  and  the  relevance  of  criteria  or  guidelines  designed  to  maintain  environmental 
quality  in  overseas  situations,  where  existing  contamination  is  at  much  greater  concen¬ 
trations,  are  additional  considerations  for  risk  management  in  Australasia. 

19.2  Ecological  risk  assessment  as  a  formal  process 

In  Australia,  the  Commonwealth  Environmental  Protection  Agency  (a  national  body) 
and  the  Environmental  Protection  Authorities  of  the  states  of  New  South  Wales  and  Vic¬ 
toria  are  currently  developing  formal  frameworks  for  ERA.  In  addition,  a  number  of  com¬ 
mercial  environmental  consultants  have  also  developed  their  own  frameworks  for  ERA 
{e.g.,  Calvert  and  Baker  1995;  Wenning  etal  1995a).  The  Sydney  Water  Corporation,  in 
New  South  Wales,  now  has  a  statutory  requirement  to  conduct  an  ERA  for  wastewater 
discharges,  and  the  state  environmental  protection  authority  is  required  to  audit  the 
methodology  and  results  (Water  Studies  Centre  1995).  In  New  Zealand,  the  Ministry  of 
Fisheries  is  reviewing  current  practices  of  risk  assessment  and  analysis  in  general,  with 
the  eventual  aim  of  providing  consistency  within  and  among  the  various  agencies  provid¬ 
ing  advice  on  these  matters  to  central  government.  The  Ministry  for  the  Environment  in 
New  Zealand  is  investigating  potential  frameworks  for  ERA,  both  from  a  site-  or  contami- 


SETAC  Press 


353 


354 


ECOLOGICAL  RISK  ASSESSMENT  OF  CONTAMINATED  SEDIMENTS 


nant-specific  perspective  and  as  a  general  framework  to  help  set  priorities  for  environ¬ 
mental  protection  and  management.  As  has  been  the  case  elsewhere,  the  initial  focus  in 
many  aspects  of  risk  assessment  for  aquatic  environments  has  been  on  the  water  column 
rather  than  the  sediments. 

The  New  South  Wales  Environmental  Protection  Authority  has  reviewed  methods  for 
ERA  developed  overseas  as  potential  frameworks  for  use  in  Australia  for  the  development 
of  new  water  quality  guidelines  (Water  Studies  Centre  1995).  It  is  currently  considering 
the  Dutch  method  in  preference  to  those  developed  in  the  United  States  and  Canada 
(Warne  and  Davies  1995).  This  preference  was  based  on  perceived  relative  cost-effective¬ 
ness,  ease  of  implementation,  and  the  identification  of  action  to  be  taken  at  the  end  of  the 
process  of  ERA.  With  respect  to  the  identification  of  action  to  be  taken,  the  Dutch 
method  derives  environmental  quality  criteria  (EQC)  by  extrapolation  from  ecotoxico- 
logical  data,  using  the  method  of  Aldenberg  and  Slob  (1993,  cited  in  Water  Studies  Cen¬ 
tre  1995).  Environmental  quality  criteria  are  defined  as  permitted  concentrations  of 
contaminants  in  the  environment  and  are  derived  as  part  of  the  process  of  risk  assess¬ 
ment.  These  EQC  would  then  be  used  in  the  proposed  Australian  context  as  screening 
levels  to  identify  situations  in  which  further  assessment  is  warranted.  The  method  could 
also  be  used  for  site-specific  assessments.  The  United  States  and  Canadian  methods  do 
not  lead  so  directly  to  the  identification  of  action  to  be  taken,  this  being  the  role  of  the 
subsequent  process  of  risk  management,  which  is  separate  from  the  risk  assessment 
(Norton  et  al  1995). 

Sediment  quality  criteria  and  guidelines  are  currently  used  in  Australia  and  New  Zealand 
on  a  fairly  ad  hoc  basis,  with  no  national  guidelines  or  standards  yet  in  place.  In  this  dis¬ 
cussion,  criteria  are  defined  as  scientific  data  and  relationships  used  to  derive  guidelines 
or  standards.  Guidelines  are  defined  as  numerical  or  narrative  statements  designed  to 
maintain  environmental  quality,  while  standards  are  guidelines  which  have  statutory 
force  to  maintain  environmental  quality.  These  definitions  follow  those  of  Smith  et  al 
(1994).  In  Australia,  the  development  of  sediment  quality  criteria  is,  apparently,  not  con¬ 
sidered  a  high  priority  and  if  guidelines  or  standards  are  eventually  adopted,  they  are 
likely  to  be  derived  largely  from  work  done  in  other  countries  (Pollution  Research  Pty  Ltd 
1994).  There  will,  therefore,  be  an  associated  need  to  assess  the  ability  of  these  guidelines 
to  identify  correctly  toxic  sediments  in  Australian  (or  New  Zealand)  environments.  This 
applies  both  to  the  ability  reliably  to  classify  as  toxic  those  sediments  that  have  a  biologi¬ 
cal  effect  and  as  nontoxic  those  that  do  not.  The  Resource  Management  Act  (1991)  in 
New  Zealand  provides  the  opportunity  for  the  Minister  for  the  Environment  to  recom¬ 
mend  that  standards  be  established  or  set  for  contaminants  and  for  the  quality  of  water 
and  soils,  but  this  has  not  yet  been  used.  To  date,  in  both  Australia  and  New  Zealand, 
criteria  that  have  been  used  and  guidelines  that  have  been  proposed  are  based  on  those 
developed  overseas.  For  example,  a  recent  draft  study  on  the  dumping  of  dredged  spoil 
prepared  for  the  CEPA  in  Australia  (Pollution  Research  Pty  Ltd  1994)  presented  a  pre¬ 
liminary  list  of  screening  concentrations  and  concentrations  above  which  unconfmed 
disposal  at  sea  was  not  considered  suitable  for  a  range  of  organic  and  inorganic  contami- 
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nants.  These  concentrations  were  based  very  largely  on  those  developed  for  the  disposal 
of  dredged  material  in  Florida  (MacDonald  Environmental  Sciences  Ltd  1992).  The  only 
modifications  made  were  for  those  trace  metals  whose  background  concentrations  in 
Australia  were  known  to  be  larger  than  those  used  in  the  study  in  Florida. 

There  is  currently  no  provision  of  overall  objectives  and  policies  for  ERA  by  risk  manag¬ 
ers  in  appropriate  environmental  management  agencies  at  regional  or  national  level  to 
give  relevance,  direction,  and  consistency  to  decisions  made  locally.  In  New  Zealand,  the 
Resource  Management  Act  (1991)  is  designed  to  control  the  effects  of  human  activities, 
rather  than  the  activities  themselves,  which  suggests  that  it  could  provide  a  suitable  con¬ 
text  for  risk  assessment  and  management.  It  specifies  an  overall  objective  of  sustainable 
management  of  natural  resources  by  “...managing  the  use,  development,  and  protection 
of  natural  and  physical  resources  in  a  way,  or  at  a  rate,  which  enables  people  and  commu¬ 
nities  to  provide  for  their  social,  economic,  and  cultural  well-being  and  for  their  health 
and  safety  while...(a)  Sustaining  the  potential  of  natural  and  physical  resources.. .to  meet 
the  reasonably  foreseeable  needs  of  future  generations;  and  (b)  Safeguarding  the  life-sup¬ 
porting  capacity  of  air,  water,  soil  and  ecosystems;  and  (c)  Avoiding,  remedying  or  miti¬ 
gating  any  adverse  effects  on  the  environment.”  The  Act  does  not,  however,  provide  more 
specific  objectives,  such  as  basic  ecological  values  to  be  protected,  or  explicit  guidance  for 
selecting  assessment  endpoints.  In  the  case  of  the  USEPA  guidelines  for  ERA,  Norton  et 
ah  (1995)  suggest  that,  as  a  first  step,  ecological  objectives  and  policies  could  be  devel¬ 
oped  from  existing  statutes,  regulations,  and  administrative  goals  and  decisions,  for  ex¬ 
ample,  the  Endangered  Species  Act  of  1973  (1988).  The  Ministry  for  the  Environment  in 
New  Zealand  is  in  the  process  of  developing  a  national  set  of  measurement  endpoints  to 
monitor  environmental  quality  throughout  the  country,  but  the  development  of  stan¬ 
dards  and  guidelines  is  still  in  a  preliminary  phase  (New  Zealand  Government  1995). 

Contamination  of  the  sediments,  water,  and  biota  of  Lake  Rotorua  and  its  catchment,  in 
the  North  Island  of  New  Zealand,  was  assessed  as  an  initial  case  study  for  the  identifica¬ 
tion  of  environmental  issues  related  to  the  use  of  wood-preservative  chemicals  and  to 
provide  information  for  the  future  development  of  national  guidelines  and  standards  for 
the  management  of  sites  contaminated  with  such  chemicals  (Gifford  etal  1993).  The 
study  assessed  the  risk  to  human  health  from  drinking  water  from  streams  in  the 
catchment  and  from  eating  trout  from  the  lake.  Other  than  comparison  of  concentrations 
of  contaminants  in  the  water  and  sediments  with  overseas  guidelines  for  the  protection 
of  aquatic  life,  an  assessment  of  environmental  risk  was  not  attempted. 

The  Australian  and  New  Zealand  Environment  and  Conservation  Council  and  the  Aus¬ 
tralian  Water  Resources  Council  have  developed  a  strategy  for  the  sustainable  manage¬ 
ment  of  Australia’s  water  resources.  This  includes  the  development  of  water  quality 
guidelines  for  the  protection  of  fresh  and  marine  waters  (Hart  etal.  1993).  These  include 
physical-chemical  and  biological  indicators,  the  latter  being  species  richness  and  compo¬ 
sition,  net  primary  production  and  “ecosystem  function”  (expressed  as  the  ratio  of  pro¬ 
duction  to  respiration).  Guidelines  for  what  constitutes  unacceptable  change  were 
provided  for  each  of  these  endpoints.  In  the  case  of  the  biological  endpoints,  these  guide- 
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lines  appear  to  have  been  intended  largely  to  promote  discussion  of  suitable  values.  In 
New  Zealand,  recommendations  have  recently  been  made  for  the  development  of  water 
quality  guidelines  based  on  ERA  (Water  Studies  Centre  1995).  No  such  guidelines  have 
yet  been  proposed  for  sediments.  In  terms  of  translating  “broad  regulatory  mandates 
into  concrete  risk  assessment  policies  and  objectives”  (Norton  etal  1995),  then,  Australia 
and  New  Zealand  are  at  an  early  stage. 

As  elsewhere,  monitoring  and  verification  of  the  predictions  of  the  risk  characterization 
and  exposure  characterization  are  rare  in  Australia  and  New  Zealand  (Lincoln  Smith 
1991),  but  there  is  an  increasing  awareness  of  their  importance,  at  least  in  principle. 

Uncertainty  over  the  appropriateness  of  using  criteria  and  guidelines  from  other  coun¬ 
tries  has  led  to  a  number  of  challenges  to  environmental  management  decisions  involv¬ 
ing  such  criteria.  For  example,  a  significant  case  in  New  Zealand  in  which  the  disposal  of 
contaminated  sediment  in  the  marine  environment  was  subject  to  a  detailed  assessment 
of  the  ecological  risk  involved  was  in  1990  and  concerned  the  proposed  dumping  of  spoil 
dredged  from  Auckland’s  Waitemata  Harbour  into  the  waters  of  the  nearby  Hauraki 
Gulf.  The  risk  assessment  was  based  on  a  modification  of  the  protocol  described  in  the 
USEPA  (1990)  Draft  Dredged  Material  Testing  Manual.  The  actual  protocol  used,  how¬ 
ever,  deviated  from  that  described  in  a  number  of  ways  (Roper  1991a).  The  first  tier  of  the 
USEPA  protocol  only  allows  dumping  of  material  without  further  testing  if  it  has  been 
obtained  from  a  high-energy  environment  or  a  site  remote  from  all  sources  of  pollution, 
or  if  it  is  to  be  used  for  beach  nourishment.  The  first  tier  of  the  proposed  protocol  in  the 
assessment  of  the  material  from  Waitemata  Harbour,  however,  allowed  dumping  if  the 
concentrations  of  contaminants  in  the  material  were  less  than  the  screening  levels  devel¬ 
oped  by  the  State  of  Washington  for  dumping  of  material  in  Puget  Sound,  Washington 
(Barrick  et  al.  1988).  This  was  despite  the  fact  that  these  levels  were  explicitly  developed 
for  Puget  Sound,  their  uncritical  use  elsewhere  was  advised  against  (Barrick  etal  1988), 
and  the  fact  that  the  USEPA  protocol  stated  that  “...at  present,  chemical  analyses  cannot 
be  used  to  directly  evaluate  the  biological  effects  of  any  contaminants  or  combination  of 
contaminants  in  dredged  material.”  Furthermore,  the  protocol  used  in  the  assessment 
allowed,  potentially,  for  disposal  after  effects  on  the  water  column  alone  had  been  consid¬ 
ered.  In  contrast,  the  protocol  described  by  the  USEPA  states  that  effects  on  the  water 
column  and  the  benthos  must  both  be  considered.  Another  objection  to  the  protocol 
related  to  the  fact  that  dumping  could,  potentially,  be  permitted  without  assessment  of 
bioaccumulation.  Toxicity  tests  for  water  column  effects  were  done  on  only  one  species, 
rather  than  several  as  recommended  in  the  USEPA  protocol.  This  test  used  larvae  of  the 
oyster  Crassostrea  gigas  and  was  criticized  on  the  grounds  that  it  used  an  introduced 
rather  than  an  indigenous  species  and  that,  since  the  species  occurs  in  environments 
known  to  be  contaminated,  the  larvae  are  presumably  relatively  hardy  and,  therefore,  not 
likely  to  give  adequate  indication  of  the  toxicity  of  the  dredged  material  to  other  mem¬ 
bers  of  the  fauna.  For  benthic  toxicity  tests,  the  species  tested  were  the  same  North 
American  species  as  used  in  the  USEPA  protocol.  Tests  using  native  species  of  inverte- 
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brates  were  available  (see  below)  but  were  not  Used.  Again,  the  appropriateness  of  using 
exotic  species  as  surrogates  for  indigenous  ones  was  questioned  by  objectors. 

Some  of  these  procedures  were  subsequently  modified  in  response  to  objections  raised 
during  the  appeal  against  the  initial  granting  of  the  right  to  dump,  though  not  to  the  full 
satisfaction  of  the  objectors  (Roper  1991b). 

A  further  important  aspect  to  this  and  many  other  risk  assessments  in  New  Zealand  and 
Australia  is  the  acceptability  of  the  proposed  activity  to  local  indigenous  people  on  cul¬ 
tural,  traditional,  or  spiritual  grounds.  In  the  case  of  the  proposed  dumping  of  dredged 
spoil  in  the  Hauraki  Gulf,  the  traditional  Maori  guardians  (the  tangata  whenua)  of  the 
waters  concerned  objected  to  the  dumping  on  grounds  that  it  was  incompatible  with 
their  obligation  to  preserve  the  quality  and  integrity  (spiritual  as  well  as  biological)  of 
that  environment.  Although  the  balancing  of  costs  and  benefits  of  the  proposed  action  to 
the  various  stakeholders  is  part  of  the  process  of  risk  management  rather  than  risk  assess¬ 
ment,  the  assessment  of  risk  must  cover  the  interests  of  all  of  the  stakeholders. 

A  later  study  of  environmental  effects  of  the  dumping  of  contaminated  dredged  spoil 
from  Port  Nelson,  New  Zealand,  into  the  adjacent  Tasman  Bay  (Roberts  1992)  showed 
greater  use  of  local  information.  Although  material  from  the  Port  had  been  dumped  at 
the  same  site  for  the  previous  30  years,  the  introduction  of  the  Resource  Management  Act 
(1991)  required  assessment  of  any  impacts  arising  from  the  dumping  of  the  spoil.  In  a 
sediment  quality  triad  approach  (Long  and  Chapman  1985),  the  assessment  incorpo¬ 
rated  measurement  of  concentrations  of  contaminants  in  the  sediments  in  the  dredged 
area  and  at  the  dump  site,  toxicity  testing  of  the  dredged  material  on  a  native  species  of 
amphipod  {Paracorophium  excavatum),  and  comparison  of  the  benthic  faunas  at  the 
dump  site  and  nearby  control  areas.  Concentrations  of  heavy  metals,  PCBs,  and  PAHs  in 
the  dredged  material  were  larger  than  in  sediments  from  Tasman  Bay,  but  there  were  no 
significant  differences  between  concentrations  at  the  dump  site  and  control  locations. 
Sediments  from  the  Port  were  more  toxic  to  P.  excavatum  than  those  from  control  sites, 
but  there  was  no  difference  between  the  dump  site  and  the  controls,  nor  was  there  any 
evidence  that  dumping  had  caused  a  change  in  benthic  communities.  Concentrations  of 
contaminants  in  a  gastropod  mollusc  (Austrofusus  glans),  which  was  the  subject  of  a  trial 
fishery  for  human  consumption,  did  not  differ  significantly  between  the  dump  site  and 
control  sites.  An  elutriate  test  indicated  that  risk  of  release  of  contaminants  from  the  sedi¬ 
ments  to  the  water  column  was  small.  On  the  basis  of  these  results,  it  was  recommended 
that  the  environmental  risks  associated  with  continued  dumping  at  the  site  were  likely  to 
be  smaller  than  (unquantified)  risks  associated  with  changing  to  other  marine  sites  or  to 
dumping  on  land  (assessment  of  alternatives  to  the  proposed  activity  is  a  stipulation  of 
the  Resource  Management  Act).  It  is  perhaps  worth  pointing  out  that,  as  is  generally  the 
case  in  Australasia  and  other  parts  of  the  world,  there  were  no  analyses  of  the  power  of 
the  statistical  tests  of  the  (null)  hypotheses  of  no  difference  between  samples  from  the 
dump  site  and  control  sites.  McBride  et  al  (1993),  McDonald  and  Erickson  (1994),  and 
Erickson  and  McDonald  (1995),  among  others,  have  discussed  the  inappropriateness  of 
using  such  tests  of  difference  in  situations  where  the  consequences  of  Type  II  errors  (/.e.. 
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accepting  the  null  hypothesis  of  no  adverse  effect  when,  in  fact,  there  is  one)  are  likely  to 
cause  environmental  damage.  Alternative  approaches  are  examined  in  the  references 
cited. 

Because  of  the  lack  of  information  about  ecological  effects,  the  process  of  combining  ex¬ 
posure  characterization  and  ecological  response  is  generally  qualitative  rather  than  quan¬ 
titative.  Similarly,  uncertainty  analyses,  when  performed,  are  also  qualitative.  There  have 
been  exceptions  to  this  pattern.  Some  commercial  environmental  consultants,  for  ex¬ 
ample,  have  employed  probabilistic  methods  for  assessing  exposure  to  contaminants 
(Wenning  et  al.  1995b). 

In  the  current  ad  hoc  approach  to  risk  assessment,  the  roles  of  risk  assessor  and  risk  man¬ 
ager  in  New  Zealand  or  Australia  are  not  rigorously  separated,  as  in  the  USEPA  frame¬ 
work.  The  applicant,  the  regulatory  governmental  organization,  and  the  contractor 
(acting  on  behalf  of  either),  where  appropriate,  may  all  take  part  in  the  problem  formu¬ 
lation  phase.  The  applicant,  or  a  contractor  acting  on  their  behalf,  may  collect  the  infor¬ 
mation  on  exposure  and  response.  The  risk  characterization  may  be  done  by  the 
applicant  and/or  contractor  but  there  may  be  discussion  with  the  governmental  organi¬ 
zation.  Often,  the  assessor  may  also  advise  the  client  (who  may,  effectively,  be  the  risk 
manager)  on  methods  of  reducing  impacts.  Alternatively,  the  manager  may  be  a  local, 
regional,  or  national  governmental  organization,  who  operate  in  the  role  through,  for 
example,  the  setting  of  conditions  attached  to  the  granting  of  permission  to  conduct  the 
activity  in  question.  Setting  of  these  conditions  is  not  done  in  a  formal  ERA  framework, 
although  conditions  set  may  be  based  on  previous  applications  or  on  set  procedures. 
Involvement  of  the  risk  assessor  in  the  process  of  risk  management  is  in  contrast  to  the 
strict  separation  of  risk  assessor  and  risk  manager  at  the  risk  management  stage,  pre¬ 
scribed  in  the  USEPA  process  (Norton  et  al  1995).  In  the  USEPA  process,  management 
roles  (during  problem  formulation  and  the  integrative  phase  between  risk  assessment 
and  risk  management)  are  influenced  by  societal  and  scientific  considerations  during  an 
interpretive  stage,  but  the  risk  management  process  itself  is  done  by  the  risk  manager  in 
isolation.  In  the  Dutch  protocol,  as  mentioned  above,  derivation  of  EQC  is  a  direct  out¬ 
come  of  the  assessment  itself 

19.3  Particular  problems  of  sediment  ecological  risk 
assessment  in  Australia  and  New  Zealand 

In  Australia  and  New  Zealand,  methodological  uncertainty  arises  from  lack  of  informa¬ 
tion  on  native  species,  ecosystems  {e.g.,  their  functioning  and  their  inertia,  stability  and 
resilience  sensu  Underwood  1989),  and  existing  stresses.  Existing  stresses  might  include 
chemical,  physical  {e.g.,  changes  in  sedimentation  in  estuaries  and  other  aquatic  habitats 
as  a  result  of  deforestation  and  agricultural  practices  since  the  arrival  of  Europeans),  and 
biological  {e.g.,  interactions  with  introduced  species).  Methods  developed  for  severely 
contaminated  parts  of  the  world  may  be  inappropriate  for  regions  where  there  is  rela¬ 
tively  little  chemical  contamination  (a  risk  management  consideration,  but  one  which 
also  affects  the  choice  of  endpoints,  e.g.,  acute  versus  chronic,  and  hazard  identification). 


SET  AC  Press 


Session  7:  International  Perspectives,  Chapter  1 9 


359 


The  use  of  criteria  from  environments  that  are  already  severely  modified  in  the  Australa¬ 
sian  situation  may  be  similarly  inappropriate.  For  example,  in  the  case  of  the  proposed 
dumping  of  dredged  spoil  in  the  Hauraki  Gulf,  the  objectors  pointed  out  that  the  screen¬ 
ing  levels  cited  in  the  Puget  Sound  guidelines  were  larger  than  concentrations  found  in 
many  of  the  most  polluted  environments  in  New  Zealand.  In  some  of  these  environ¬ 
ments,  however,  ecological  impacts  were  known  to  have  occurred  (as,  for  example,  the 
demonstrated  effects  of  chlordane  in  sediments  at  concentrations  less  than  the  screening 
levels,  discussed  below). 

The  fact  that  risks  to  local  species  are  generally  unknown  leads  to  reliance  on  standards 
or  criteria  derived  from  overseas  studies,  often  without  demonstration  of  measurable 
effects  (since  these  presumed  effects  are  based  on  overseas  species  and  habitats).  This 
takes  such  use  of  criteria  outside  the  area  of  ERA  since  actual  effects  are  not  being  taken 
into  account  (Chapter  1). 

There  is  evidence  that  reliance  on  criteria  derived  from  studies  on  nonnative  species 
(whether  data  are  gathered  overseas  or  in  Australasia  using  exotic  species)  does  not  ad¬ 
equately  assess  risks  to  native  species.  Hickey  (1989)  tested  a  range  of  toxicants  on 
native  New  Zealand  species  and  on  exotic  species.  He  found  that  the  native  cladoceran 
Ceriodaphnia  dubia  was  up  to  four  times  more  sensitive  in  acute  tests  than  Daphnia  ma- 
gna.  Other  differences  in  sensitivity  have  been  found  between  native  New  Zealand  species 
and  those  from  overseas  used  in  standard  tests  (Hickey  and  Vickers  1994).  In  contrast, 
Hickey  and  Roper  (1992)  reported  that  the  sensitivities  of  two  native  species  of  amphi- 
pods  (one  tube-dwelling,  one  burrowing)  to  cadmium  were  similar  to  values  expected  on 
the  basis  of  work  overseas. 

The  development  of  toxicity  tests  for  sediments  using  native  species  from  Australia  and 
New  Zealand  is  in  progress,  although  most  work  to  date  has  focused  on  contaminants  in 
the  water  column.  Tests  on  sediments  include  sublethal,  behavioral  responses  of  marine 
bivalves  (Roper  and  Hickey  1994)  and  effects  on  larval  development  in  sanddollars  (Nip¬ 
per  and  Roper  1995).  Sublethal  tests  on  bivalves  seem  to  be  an  appropriately  sensitive 
and  ecologically  relevant  means  of  assessing  effects.  Hickey  and  Martin  (1995)  compared 
the  relative  sensitivities  of  five  species  of  freshwater,  benthic  invertebrates  (an  amphipod, 
a  clam,  an  oligochaete,  a  tanaid,  and  a  mayfly)  to  reference  toxicants  and  to  bleached 
kraft  mill  sediment.  There  was  a  large  range  in  sensitivity  among  species  and  among  toxi¬ 
cants.  Clam  reburial,  amphipod  survival,  and  oligochaete  reproduction  were  the  most 
sensitive  endpoints  for  detecting  effects  of  contaminants.  A  review  of  work  in  New 
Zealand  is  given  by  Hickey  (1995).  In  Australia,  tests  for  effects  of  sediment  have  been 
developed  with  burrowing  amphipods  and  isopods  and  for  effects  of  pore  water  with 
amphipods  and  larvae  of  oysters  and  prawns  (many  of  these  results  have  not  yet  been 
published).  Ahsanullah  etal.  (1984)  and  Weimin  etal  (1992)  have  examined  bioaccumu¬ 
lation  of  heavy  metals  from  sediments  by  Australian  species  of  molluscs,  polychaetes,  and 
crustaceans.  In  general,  there  does  not  yet  seem  to  have  been  much  work  on  development 
of  tests  of  sediment  toxicity  using  organisms  from  tropical  parts  of  Australia. 
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As  part  of  the  National  Pulp  Mills  Research  Program,  funded  by  the  Commonwealth 
Government  of  Australia,  Moverley  et  al.  (1995)  have  developed  mesocosms  with  marine 
benthic  meiofauna  (animals  with  body  sizes  in  the  range  63  to  500  Tm)  and  examined 
their  potential  use  for  ecotoxicological  testing.  Responses  of  the  fauna  to  perfusion  of  the 
sediments  with  different  concentrations  of  4-chlorophenol  were  detected  using  multivari¬ 
ate  statistical  methods.  These  results  suggested  that  the  mesocosms  were  a  useful  tech¬ 
nique  for  testing  chronic  effects  of  contaminated  sediments  on  meiofaunal  communities. 

Changes  to  the  amount  of  sediment  in  suspension  and  rates  of  deposition  in  streams  and 
estuaries  have  been  widespread  since  the  arrival  of  Europeans.  Assessments  of  the  eco¬ 
logical  effects  of  these  changes  are  an  important  contribution  to  ERA  for  sediments  in 
Australasia.  Quinn  and  Hickey  (1993)  compared  the  composition  of  benthic  faunal  as¬ 
semblages  in  streams  above  and  below  outfalls  of  sewage  effluent.  The  percentage  of  taxa 
whose  abundance  differed  significantly  between  sites  above  and  below  the  outfalls  corre¬ 
lated  inversely  with  the  dilution  of  the  effluent  and  directly  with  the  concentration  of 
suspended  solids.  In  an  equivalent  study  of  assemblages  above  and  below  clay  discharges 
from  alluvial  gold-mining,  Quinn  et  al.  (1992)  found  that  abundances  of  invertebrates 
downstream  of  discharges  as  a  proportion  of  abundances  upstream  were  inversely  corre¬ 
lated  with  the  log  of  turbidity  loadings.  Taxonomic  richness  showed  a  similar  relation¬ 
ship. 

Determination  of  the  ecological  relevance  of  toxic  effects  of  contaminants  observed  in 
laboratory  ecotoxicological  tests  requires  studies  in  the  field.  Surveys  and  comparisons  of 
spatial  and  temporal  variation  in  the  distributions  of  contaminants  and  organisms  pro¬ 
vides  correlative  evidence  of  impacts.  Demonstration  of  cause  and  effect  under  realistic 
conditions  requires  manipulative  field  experiments.  Both  of  these  approaches  have  been 
tried  in  Australia  and  New  Zealand.  There  are  numerous  examples  of  correlative  studies, 
such  as  that  by  Ward  and  Young  (1982),  of  epibenthic  faunas  in  seagrass  beds  near  a  lead 
smelter  in  South  Australia.  In  New  Zealand,  Roper  etal.  (1989)  described  the  distribution 
of  benthic  organisms  around  coastal  outfalls  in  the  towns  of  Gisborne  and  Hastings,  and 
Roper  etal  (1988)  described  apparent  effects  of  diffuse  contamination  derived  from  ur¬ 
ban  runoff  on  benthic  communities  in  the  Manukau  Harbour  near  Auckland. 

In  a  manipulative  field  experiment,  Pridmore  et  al  (1992)  experimentally  contaminated 
the  surface  sediments  of  a  large  area  (300  m^)  of  muddy  sandflat  with  technical  chlor- 
dane.  The  contaminant  was  applied  by  soaking  sediment  in  the  laboratory  in  a  volatile 
solvent  in  which  chlordane  had  been  dissolved,  drying  the  sediment,  and  spreading  it  on 
the  surface  of  the  sandflat  in  an  area  without  existing  chlordane  contamination.  This 
treatment  resulted  in  a  reduced  abundance  of  two  species  of  bivalves  compared  to  a 
nearby  control  area.  The  concentration  of  chlordane  in  near-surface  sediments  at  the 
experimental  site  during  the  period  when  numbers  of  bivalves  showed  the  largest 
changes  was  6.7  to  8.7ng.(g  dry  fines)  ^  This  concentration  was  similar  to  those  in  con¬ 
taminated  areas  of  the  same  harbor  where  the  experiment  was  done  and  to  those  re¬ 
ported  from  overseas  studies  of  chlordane  contamination. 
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To  examine  the  effects  of  copper  contamination  on  the  fauna  of  sandy  subtidal  sediments 
in  Botany  Bay,  near  Sydney,  Australia,  Morrisey  et  al  (1995  1996)  experimentally  en¬ 
hanced  the  concentration  of  copper  in  the  sediment  by  burying  blocks  of  building  plas¬ 
ter  impregnated  with  copper  sulphate.  This  treatment  caused  a  change  in  the  abundance 
of  a  number  of  types  of  animals  over  a  period  of  six  months,  relative  to  control  treatments 
(unmanipulated  areas  and  areas  in  which  blocks  of  plaster  without  copper  had  been  bur¬ 
ied). 

The  general  absence  of  severe  historical  contamination  from  anthropogenic  sources  in 
aquatic  environments  in  Australia  and  New  Zealand  provides  opportunities  for  studying 
the  effects  of  diffuse  forms  of  contamination.  In  New  Zealand,  attention  has  recently  fo¬ 
cused  on  diffuse  contamination  from  urban  runoff.  An  ongoing  study  by  the  National 
Institute  of  Water  and  Atmospheric  Research  Ltd  in  New  Zealand  is  combining  partly 
conceptual/partly  quantitative  modeling  of  the  accumulation  of  contaminants  derived 
from  runoff  in  the  sediments  of  urbanized  estuaries,  correlative  field  studies  of  the  eco¬ 
logical  impacts  of  these  contaminants,  in  terms  of  patterns  of  abundance  and  distribu¬ 
tion  of  contaminants  and  sediment-living  animals,  and  laboratory-based  studies  of  the 
toxicity  of  sediments  from  these  estuaries  to  native  species. 

19.4  The  appropriateness  of  a  formal  framework  for 
ecological  risk  assessment 

How  appropriate  is  a  formal  framework  in  situations  where  there  is  little  information  on 
local  conditions  (type  of  habitat,  existing  concentrations  of  contaminants,  functions  of 
ecosystems),  species,  or  environments? 

1)  The  advantage  in  such  a  situation  is  that,  in  theory,  a  formal  framework  reduces 
the  tendency  for  ad  hoc,  inconsistent  decisions  from  one  case  to  another.  Such 
inconsistency  arises  from  variations  in  experience  of  the  risk  assessors  and 
managers  and  variation  in  the  amount  of  information  available  on  which  to 
base  risk  assessments  and  decisions  relating  to  risk  management.  Promotion  of 
“consistency  across  EPA  assessment”  is  one  of  the  reasons  behind  the  develop¬ 
ment  of  a  framework  for  ERA  by  the  USEPA  (Norton  et  al  1995).  Even  in  a 
relatively  well-developed  framework,  such  as  that  of  the  USEPA,  as  discussed  in 
Chapter  1,  there  is  much  variation  in  actual  practice  of  ERA  (relative  to  other 
forms  of  risk  assessment)  because  of  the  early  stage  of  development  of  the 
process  and  the  lack  of  guidelines  from  appropriate  regulatory  agencies. 

2)  Provision  of  such  a  framework,  however,  may  itself  lead  to  uncritical  applica¬ 
tions  of  inappropriate  methods  by  the  same  assessors  or  managers  seeking  easy 
solutions  to  their  dilemmas.  Derivation  of  EQC,  for  example,  may  be  valid  in  a 
northern  European  context  where  there  is  much  information  on  the  response  of 
organisms  to  contaminants  or  other  stressors  and  on  risks  of  exposure,  and 
where  assumptions  used  in  deriving  criteria  or  guidelines  are  most  likely  to  be 
reasonable.  In  Australasia,  however,  derivation  of  criteria  or  guidelines  based  on 
assumptions  developed  elsewhere  may  result  in  inadequate  characterization 


SET  AC  Press 


362 


ECOLOGICAL  RISK  ASSESSMENT  OF  CONTAMINATED  SEDIMENTS 


and  assessment  of  ecological  risk.  The  use  of  expert  knowledge  of  a  particular 
system  and  the  relevant  risks  might  be  particularly  appropriate  in  situations 
where  the  validity  of  the  assumptions  associated  with  a  formal  framework  is 
poorly  known.  Expert  opinion  is  an  important  part  of  the  problem  formulation 
stage  of  ERA,  but  it  is  also  relevant  at  other  stages  of  the  process.  Guidelines 
being  developed  for  ERA  by  the  USEPA  (Norton  et  al  1995)  “will  not  be  rigid 
and  will  encourage  the  use  of  professional  judgment  within  the  construct  of  a 
logical  and  scientifically  sound  structure.”  In  fact,  the  proposed  framework  for 
ERA  in  New  South  Wales  stresses  the  importance  of  site-specific  considerations 
and  therefore,  implicitly,  of  local,  expert  judgment  (Warne  and  Davies  1995). 

Balanced  against  the  risk  of  misapplication  of  methods  and  standards  developed  overseas 
is  the  urgent  need  for  some  form  of  protocol  for  ERA,  since  environmental  management 
decisions  have  to  be  made  now,  regardless  of  how  inadequate  the  information  underlying 
them  may  be.  The  use  of  methods  of  risk  assessment,  criteria,  and  guidelines  for  environ¬ 
mental  quality,  such  as  SQC,  developed  overseas  may  represent  the  best  available  synthe¬ 
ses  of  current  knowledge,  however  incomplete. 

19.5  Summary 

It  is  unlikely  that  either  Australia  or  New  Zealand  will  be  able  to  develop  their  own,  inde¬ 
pendent  methods  of  risk  assessment.  Which  of  the  various  models  available  overseas  is 
eventually  adopted,  and  how  and  to  what  extent  resources  allow  these  to  be  modified  for 
local  use,  remains  to  be  seen.  In  the  meantime,  an  important  contribution  to  future  de¬ 
velopment  of  formal  methods  of  risk  assessment  could  be  made  by  monitoring  and  re¬ 
porting  of  the  success  or  otherwise  of  the  various  ad  hoc  approaches  currently  being  used, 
so  that  the  information  gained  from  such  experience  can  guide  later  developments.  De¬ 
spite  numerous  pleas  {e.g.,  Hilborn  and  Walters  1981)  for  programs  of  monitoring  after 
developments  or  other  forms  of  impact  have  occurred,  assessment  of  impacts  and  the 
accuracy  of  assessment  of  risk  still  frequently  end  with  the  awarding  of  a  consent. 
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20.1  Introduction 

This  section  gives  a  brief  overview  of  the  status  of  SERA  in  Europe  in  terms  of  the  three 
applications:  product  assessment,  site  cleanup,  and  dredged  material  disposal.  In  gen¬ 
eral,  formal  Europe-wide  guidance  is  limited,  although  the  need  for  consideration  of  sedi¬ 
ment  exposure  and  effects  is  now  being  incorporated  into  European  legislation,  and 
many  research  and  development  (R&D)  programs  are  being  funded  by  international 
bodies,  to  develop  and  standardize  the  tools  required.  Also,  discussion  of  triggering  and 
assessment  frameworks  is  underway  in  several  areas. 

20.2  Product  assessment 

20.2.1  Current  legislation 

In  recent  years,  regulatory  authorities  in  Europe  have  started  to  raise  concerns  about  the 
potential  toxicity  to  benthic  organisms  of  pesticides  and  other  organic  chemicals  which 
accumulate  in  sediments.  These  concerns  are  now  beginning  to  be  implemented  into  leg¬ 
islation  for  pesticides,  e.g.,  the  new  European  Pesticide  Directive  (CEC 1991)  which  in¬ 
cludes  a  sediment  testing  requirement  for  certain  types  of  compound.  In  the  general 
chemicals  area,  there  is  as  yet  no  firm  regulatory  requirement  for  sediment  toxicity  test¬ 
ing,  and  so  this  brief  review  of  current  status  principally  reflects  product  assessment  for 
pesticides,  although  clearly  there  are  common  themes. 

20.2.2  Testing  methods 

Concerns  over  sediment  toxicity  have  resulted  in  a  great  deal  of  activity  in  the  develop¬ 
ment  of  testing  methods  over  recent  years  and  much  of  the  European  experience  is  cap¬ 
tured  in  the  comprehensive  proceedings  of  an  earlier  SETAC-Europe  workshop  held  in 
the  Netherlands  (Hill  etal  1994).  For  freshwater  assessment,  the  most  commonly  used 
organism  in  Europe  has  been  the  freshwater  dipteran,  Chironomus  riparius.  This  has  most 
likely  been  due  to  its  robustness  and  amenability  to  ecologically  relevant  endpoints  like 
emergence,  for  which  it  is  possible  to  develop  ECx  measures  of  response,  rather  than  re¬ 
liance  on  NOECs.  The  useMness  of  the  latter  has  been  a  matter  of  debate  in  Europe  over 
recent  years,  at  least  partly  because  of  their  dependence  on  experimental  design. 

In  June  1995,  a  meeting  of  international  experts  was  organized  by  the  OECD  in  Copen¬ 
hagen,  Denmark  to  discuss  the  prioritization  of  benthic  testing  methods  (OECD  1995). 
The  recommendation  of  the  group  was  that  Chironomus  spp.  should  be  the  first  priority 
for  the  development  of  a  standardized  OECD  sediment  toxicity  method  (as  would  be 
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used  for  product  assessment).  It  therefore  seems  likely  that  this  genus  will  become  the 
principal  sediment  testing  invertebrate  for  products  in  Europe,  much  as  Daphnia  spp. 
has  been  used  as  the  representative  organism  for  water  column  or  pelagic  invertebrates. 
The  second  priority  recommendation  for  a  standardized  OECD  guideline  was  estuarine 
and  marine  amphipods.  In  both  cases,  protocols  that  have  already  been  developed  by 
other  international  bodies  will  be  used  as  a  basis  for  development  of  definitive  guidelines. 
For  example,  available  European  guidelines  include  acute  and  chronic  toxicity  tests  with 
Chironomus  riparius  and  Corophium  standardized  for  the  European  Commission 

(Fleming  et  al  1994),  an  acute  Corophium  volutator  protocol  for  evaluation  of  drilling 
muds  and  chemicals  used  in  offshore  operations  (Paris  Commission  [PARCOM]  1994), 
and  a  Chironomus  riparius  protocol  for  measuring  the  toxicity  of  products  applied  to  the 
water  column  of  a  sediment  system,  in  order  to  simulate  an  overspray  scenario 
(Biologische  Bundesanstalt  Fiir  Land-Und  Forstwirtschaft  [BBA/IVA]  1994).  It  is  in¬ 
tended  that  acute  and  chronic  methods  will  be  incorporated  into  the  same  generic  OECD 
guideline  and  a  list  of  appropriate  species  will  be  recommended.  The  use  of  artificial  sedi¬ 
ments,  when  fully  developed  and  standardized,  will  also  be  considered. 

20.2.3  Triggers 

With  testing  methods  gradually  becoming  established,  more  recently  efforts  have  begun 
to  focus  on  developing  suitable  procedures  for  triggering  sediment  toxicity  assessment. 
In  common  with  the  approaches  described  in  Chapter  4,  the  broad  view  in  Europe  is  that 
prospective  product  testing  should  be  based  on  the  propensity  of  the  chemical  to  adsorb 
and  persist  and  be  potentially  toxic.  Indeed  this  view  is  reflected  in  the  European  Pesti¬ 
cide  Directive  (91/414/EC),  which  states: 

Where  environmental  fate  and  behaviour  data... report  that  an  active  substance 
is  likely  to  partition  to  and  persist  in  aquatic  sediments,  expert  judgement 
should  be  used  to  decide  whether  an  acute  or  a  chronic  sediment  toxicity  test  is 
required.  Such  expert  judgement  should  take  into  account  whether  effects  on 
sediment  dwelling  invertebrates  are  likely.... 

Currently,  the  European  Crop  Protection  Association  (ECPA)  and  the  American  Crop  Pro¬ 
tection  Association  (ACPA)  are  together  developing  proposals  for  measures  of  these  ad¬ 
sorption,  persistence  and  toxicity  triggers.  Suitable  measures  are  under  discussion,  and 
proposals  include  the  use  of  soil  aerobic  degradation  rates  as  an  indication  of  persistence, 
OC  partitioning  coefficients  (Kq^)  to  assess  adsorption,  and  data  from  Daphnia  acute  or 
chronic  toxicity  tests  to  estimate  toxicity  (Table  20-1).  The  advantages  of  using  these  end¬ 
points  for  triggering  sediment  studies  are  that  the  data  are  generated  as  a  matter  of 
course  for  the  vast  majority  of  pesticides  and  that,  with  suitably  selected  values,  they  are 
likely  to  provide  conservative  estimates  of  persistence,  adsorption,  and  toxicity  in  sedi¬ 
ment  (Hamer  et  al  1995).  As  with  many  triggers,  more  realistic  data  (with  less  uncer¬ 
tainty)  can  be  used  to  supersede  these  preliminary  triggers  if  appropriate.  The  values 
proposed  for  each  of  these  measures  are  currently  being  evaluated  from  company  data- 
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Table  20-1  ECPA/ACPA  proposed  values  for  triggering  of  sediment  toxicity  tests 


Conditions  for  sediment  toxicity  testing 

Exceptions 

If  preliminary  risk  assessments  suggest 
potential  for  impacts  on  aquatic 
invertebrates,  then  sediment  toxicity 
assessment  is  required 

1)  if  there  is  potential  for  partitioning  to 
and  persistence  in  sediments  such  that 

Kq^.  (OC  partition  coefficient)  ^  1000 
from  a  standard  batch  equilibrium  study 

and 

2)  DTgQ  (degradation  half  life)  ^  30  days 
in  a  soil  aerobic  degradation  laboratory 
study 

unless  it  can  be  demonstrated  that  the  DT^q  <  30  days  in  the 
aquatic  environment  (by,  e.g. ,  hydrolytic,  photochemical, 
ready  biodegradation,  water-sediment  metabolism  of  other 
appropriate  studies)  or  the  concentration  of  the  parent 
compound  ^  10%  in  sediment  after  more  than  14  days  in  a 
water-sediment  metabolism  system 

and 

3)  if  there  is  potential  for  toxicity  such 
that 

Daphnia  48-h  EC^q  <  1.0  mg/l  or  21-d 
NOEC  <0.1  mg/1 

unless  by  means  of  a  suitable  risk  assessment  that  takes  into 
account  of  partitioning  of  compound  between  water  and 
sediment,  it  can  be  shown  that  there  is  no  potential  risk  to 
invertebrates,  eg ,  the  ratios  between  the  acute  or  chronic 
effects  and  exposure  concentration  are  >10  or  >1, 
respectively. 

bases  of  regulatory  studies,  and  a  publication  of  the  findings  is  planned  for  the  near  fu¬ 
ture. 

Triggers  have  also  been  proposed  by  the  German  BBA  (Streloke  and  Kopp  1995).  These 
also  include  estimates  of  adsorption  and  persistence  from  the  BBA  sediment  metabolism 
study  (BBA  1990)  and  toxicity,  such  that  a  sediment  toxicity  test,  as  described  above,  is 
required  when  “the  active  ingredients,  toxic  metabolites  or  bound  residues  are  found  in 
the  sediment  [from  the  sediment  metabolism  study]  in  an  amount  higher  than  10%  of  the 
applied  test  substance  after  day  14.  Furthermore,  the  NOEC  from  the  chronic  toxicity  test 
with  Daphnia  should  be  lower  than  O.lmg/1  or  the  BCF  in  fish  higher  than  100.” 

20.2.4  Risk  assessment 

In  addition  to  developing  triggers  and  methods  for  sediment  toxicity  assessment,  there 
have  also  been  some  preliminary  discussions  of  how  sediment  data  will  be  used  in  a  risk 
assessment.  The  general  principle  of  risk  assessment  always  involves  the  comparison  of 
an  effect  concentration  to  a  PEC.  Although  methods  for  estimating  concentrations  in 
surface  water  are  well-established,  until  recently  few  models  have  included  PECs  in  sedi- 
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merit.  One  exception  is  TOXSWA,  a  new  model  under  development  in  the  Netherlands 
by  the  Winand  Staring  Centre  (SC-DLO;  Adriaanse  1995).  Furthermore,  the  develop¬ 
ment  of  sediment  PECs  has  also  been  included  in  the  remit  of  the  Forum  for  the  Coordi¬ 
nation  of  Pesticide  Fate  Models  and  their  Uses  (FOCUS),  so  development  of  a  hannonized 
approach  is  expected  in  the  near  future. 

20.3  Site  cleanup 

20.3.1  Current  legislation 

Until  recently,  philosophy  in  European  legislation  has  been  that  maintenance  of  accept¬ 
able  water  quality  will  automatically  lead  to  acceptable  sediment  quality.  Mention  of 
sediments  in  water  legislation  has  generally  been  restricted  to  emission  control  directives 
such  as  the  Dangerous  Substances  Directive  (76/464/EEC).  This  directive  provides  a 
framework  for  the  elimination  or  reduction  of  pollution  of  inland,  coastal,  and  territorial 
waters  by  dangerous  substances.  The  most  dangerous,  decided  on  the  basis  of  toxicity, 
bioaccumulation,  and  persistence  (List  1  or  Black  List)  should  be  eliminated;  the  less 
harmful  substances  (List  2  or  Grey  List)  must  be  reduced.  List  1  substances  are  controlled 
at  a  European  Union  level  by  means  of  daughter  directives  which  lay  down  standards  for 
specific  substances,  and  List  2  substances  are  controlled  at  a  national  level.  Under  the 
terms  of  the  Directive,  Member  States  may  adopt  either  Limit  Values  based  on  uniform 
fixed  emissions  which  are  independent  of  the  use  of  receiving  water,  or  Environmental 
Quality  Standards  for  the  receiving  water,  which  are  dependent  on  the  intended  use.  In 
the  absence  of  generic  sediment  quality  standards  for  receiving  water,  or  limit  values  for 
emissions  that  have  taken  into  account  the  potential  of  a  substance  to  bind  to  sediment, 
the  stand  still  principle”  is  often  adopted  whereby  monitoring  for  compliance  requires 
that  levels  of  dangerous  substances  in  sediments  do  not  increase. 

At  present,  there  is  a  move  within  Europe  toward  a  more  integrated  approach  to  water 
management,  focusing  on  protecting  the  aquatic  ecosystem  and  water  uses  as  a  whole. 
Changes  in  water  legislation  are  being  brought  about  by  the  development  of  a  new  Euro¬ 
pean  Commission  Framework  Water  Resource  Directive,  which  is  intended  to  form  the 
basis  of  all  future  water  policy.  This  is  likely  to  incorporate  some  of  the  existing  directives 
covering  the  protection  of  receiving  waters.  The  Framework  Directive  is  aimed  at  protect¬ 
ing  ecosystems  from  point-  and  diffuse-source  pollution  and  other  anthropogenic  influ¬ 
ences,  will  establish  links  between  quantity  and  quality  of  water,  and  will  apply  to  both 
surface  water  and  groundwater. 

The  water  quality  aspects  of  this  Directive  will  be  based  on  the  recent  European 
Commission  s  proposed  Directive  on  the  Ecological  Quality  of  Water  (COM(94)680), 
which  lays  out  a  system  for  global  estimation  and  classification  of  the  ecological  quality 
of  surface  waters.  The  measures  proposed  are  intended  to  maintain  and  improve  the  eco¬ 
logical  quality  of  waters  with  the  ultimate  aim  of  achieving  “good  ecological  quality.”  Eco¬ 
logical  quality  is  defined  by  nine  quality  elements  including  biological,  chemical,  and 
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physical  measures  or  indices.  Two  quality  elements  are  relevant  to  management  of  con¬ 
taminated  sediments: 

1)  Levels  of  toxic  and  other  harmful  substances  in  water,  sediment,  and  biota 

2)  Structure  and  quality  of  the  sediment  and  its  ability  to  sustain  the  biological 
community  of  the  ecosystem 

For  both  of  these  quality  indicators,  assessment  and  measurement  endpoints  are  cur¬ 
rently  being  investigated  by  the  European  Commission.  Should  the  Directive  be  adopted. 
Member  States  will  select  parameters  that  represent  the  most  sensitive  indicators  of  eco¬ 
logical  quality  of  the  waters  concerned.  This  approach  will  provide  guidance  on  global 
classification  of  sediment  quality,  and  it  will  allow  resources  to  be  directed  to  those  sites 
at  which  further  investigation  is  required  to  determine  whether  remedial  action  is  neces¬ 
sary,  i.e.,  site-specific  SERA. 

20.3.2  Testing  methods 

At  present,  there  is  little  formal  guidance  within  Europe  for  carrying  out  SERAs  to  help 
decision-making  in  terms  of  remediation  options  and  habitat  rehabilitation.  Site-specific 
investigations  are  occasionally  being  used,  but  the  chosen  methodologies  are  fragmen¬ 
tary  and  are  in  need  of  standardization  and  harmonization  within  and  between  Member 
States.  This  fragmentation  is  often  caused  by  different  organizations  being  contracted  to 
undertake  the  risk  assessment  and,  on  each  occasion,  methods  being  selected  on  the 
basis  of  in-house  expertise.  In  addition,  procedures  and  policies  differ  between  Member 
States,  with  different  degrees  of  importance  being  placed  on  the  role  of  ERA  for  sedi¬ 
ments.  However,  in  several  States,  methods  and  guidelines  are  being  proposed,  generally 
in  accordance  with  the  guidance  given  in  Chapter  7.  The  Netherlands  and  the  UK  are 
used  here  as  two  case  examples. 

In  the  Netherlands,  site  cleanup  decisions  are  based  largely  on  national  SQC.  National 
target  values,  limit  values,  and  intervention  values  have  been  derived  using  aquatic  tox¬ 
icity  test  data  and  the  limited  spiked-sediment  toxicity  test  data  available  (Dutch  Minis¬ 
try  of  Housing,  Physical  Planning  &  Environment  [VROM]  1994).  The  former  are 
translated  into  sediment  data  using  EqP  theory,  and  all  criteria  are  standardized  for  OC 
and  particle  size  distribution.  Short-term  decisions  on  remediation  of  highly  contami¬ 
nated  sites  are  based  on  exceedance  of  the  intervention  value  {ix.,  the  hazard  concentra¬ 
tion  [HC50],  which  theoretically  protects  50%  of  all  species).  However,  it  is  recognized 
that  these  guidelines,  because  of  their  generic  nature,  should  lead  to  a  site-specific  inves¬ 
tigation  before  any  further  decisions  are  made.  A  second-level  assessment  has  been  pro¬ 
posed  which  incorporates  chemical  analysis,  benthic  surveys  (based  primarily  on 
chironomid  and  oligochaete  abundance  and  diversity),  and  chronic  bioassays  (Chirono- 
mus  riparius  and  Daphnia  magna).  In  the  risk  analysis  phase,  emphasis  is  placed  more  on 
causal  relationships  between  the  contaminants  identified  and  effects  observed  in  the 
field,  rather  than  weighting  and  aggregating  the  data.  To  date,  only  one  large-scale  reme¬ 
diation  has  been  carried  out  in  the  Netherlands  based  on  this  type  of  assessment  (van  de 
Gucthe  1995). 
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In  the  UK,  cleanup  of  contaminated  sediments  is  less  common  than  in  the  Netherlands, 
where  the  negative  effects  of  sediment  contamination  in  some  areas  of  the  Rhine  and 
Meuse  deltas  have  long  been  proven  (van  de  Gucthe  1995).  UK  sediment  cleanup  opera¬ 
tions  are  driven  more  by  chemical-specific  pollution  incidents  and  are  dealt  with  on  an  ad 
hoc  basis  by  operational  regulatory  staff  in  the  region  affected.  Assessments  may  include 
acute  and  chronic  bioassays,  surveys  of  fish  and  bird  populations,  and  various  modeling 
approaches,  depending  on  the  nature  of  the  contaminant  involved.  However,  a  number 
of  recent  incidents  has  prompted  the  drafting  of  formal  guidelines  for  SERA,  which  are 
expected  to  be  completed  in  1996. 

Unlike  the  Netherlands,  the  UK  has  a  wide  range  of  sediment  geochemistry  and,  as  yet, 
has  no  national  sediment  quality  standards  nor  a  robust  basis  on  which  to  calculate  val¬ 
ues.  Therefore,  the  guidance  will  take  the  form  of  a  tiered  approach  as  outlined  in  Chap¬ 
ter  7,  integrating  toxicity  testing  and  benthic  community  surveys  with  measurement  of 
contaminant  levels  in  sediments  and  biota.  Effects-based  guidelines,  developed  by  other 
national  bodies,  may  be  used  in  initial  tiers. 

For  the  toxicity  testing  component,  recommended  laboratory  bioassays  have  been 
adapted  from  toxicity  test  procedures  developed  for  product  assessment,  primarily  using 
Chironomus  ripavius,  CoTophium  volutator,  and  Arenicola  marina.  Emphasis  will  be  placed 
on  the  further  development  and  use  of  chronic  bioassays  for  which  procedures  are  cur¬ 
rently  being  standardized,  and  on  in  situ  deployments  for  measurement  of  toxicity  and 
bioaccumulation  using  species  such  as  Chironomus  riparius,  Gammarus pulex,  and  Mytilus 
edulis.  For  the  biological  component,  routinely  used  procedures  such  as  the  BMWP  score 
(Biological  Monitoring  Working  Party  1978)  and  the  RIVPACS  model  (Wright  et  al 
1993)  will  continue  to  be  used  for  assessing  biological  quality  of  freshwater  systems, 
while  techniques  for  estuaries  and  coastal  waters  are  currently  under  development  and 
are  due  to  be  tested  on  a  large  scale  in  1997. 

20.4  Dredged  material  disposal 

20.4.1  Current  legislation 

In  different  European  Member  States,  dredging  operations  and  the  disposal  of  dredged 
material,  whether  to  sea  or  land,  are  subject  to  a  range  of  statutory  and  other  controls. 
Disposal  of  dredged  material  to  sea  falls  under  the  remit  of  the  London  Convention 
(1972),  which  has  now  been  ratified  by  more  than  70  countries  and  controls  the  dump¬ 
ing  of  “noxious  substances,”  including  chemical  waste,  into  the  oceans.  Other  local  con¬ 
ventions  are  also  in  place,  such  as  the  OSPAR  Convention  (OSPAR  1992;  derived  from 
Oslo  and  Paris  Conventions),  which  is  aimed  at  protecting  the  northeast  Atlantic  and  in¬ 
cluding  the  North  Sea  from  pollution  by  dumping  by  ships  and  aircrafts  and  land-based 
sources,  and  applies  to  all  countries  which  border  on  these  areas  or  have  rivers  that  dis¬ 
charge  into  them.  Many  Member  States  have  developed  their  own  control  procedures  for 
dredging  and  disposal,  to  fit  in  with  the  relevant  international  conventions.  Control  is 
carried  out  by  licensing  authorities  within  each  State. 
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Assessing  the  effects  of  dredged  material  disposal  requires  consideration  of  both  the  dis¬ 
posal  site  and  the  material  to  be  dredged.  Some  guidance  on  the  criteria  to  be  considered 
in  the  licensing  process  is  given  in  the  international  conventions.  The  London  Conven¬ 
tion  is  recently  undergoing  substantial  revision,  and  an  assessment  procedure  for 
dredged  material  disposal  licensing  was  adopted  in  December  1995  (the  Dredged  Mate¬ 
rial  Assessment  Framework  [London  Convention  1995]).  For  the  disposal  site,  consider¬ 
ation  of  the  location  must  be  taken  into  account,  and  the  effects  on  navigation,  coast 
protection,  fishing  amenity,  nature  conservation,  and  other  legitimate  uses  of  the  sea 
considered.  Disposal  in  sensitive  areas  such  as  live  coral  reefs  and  other  areas  adapted  to 
low  turbidity  is  not  recommended.  For  the  dredged  material  itself,  the  international  con¬ 
ventions  give  general  guidance  on  physical  and  chemical  considerations.  The  Oslo  Com¬ 
mission  Guidelines  for  the  Management  of  Dredged  Material  (OSPAR  1993)  give 
guidance  on  the  assessment  of  dredged  material  in  terms  of  characteristics,  composition, 
sampling,  and  analysis  for  both  management  of  dredged  material  and  subsequent  moni¬ 
toring  programs.  In  these  guidelines,  the  dredged  material  is  exempt  from  testing  if  it  has 
not  been  exposed  to  appreciable  sources  of  contamination  and  satisfies  the  following 
criteria: 

1)  It  is  composed  almost  exclusively  of  sand,  gravel,  or  rock. 

2)  It  is  for  beach  nourishment  or  restoration. 

3)  The  amount  is  less  than  10,000t/year  and  there  is  existing  information  on 
sediment  quality. 

Further  chemical  information  is  required  for  those  dredged  materials  which  do  not  meet 
these  exemptions  to  establish  whether  disposal  may  cause  undesirable  effects  on  marine 
organisms  or  human  health.  Attention  is  paid  to  bioaccumulation  in  marine  organisms, 
particularly  in  food  species.  If  substances  are  present  for  which  biological  effects  are  not 
fully  understood,  the  use  of  biological  test  procedures  is  recommended.  These  may  in¬ 
clude  laboratory  tests  for  acute  toxicity,  chronic  toxicity,  and  bioaccumulation. 

20.4.2  Testing  methods 

Some  detail  of  procedures  in  the  Netherlands  is  given  in  Chapter  5.  This  section  will  re¬ 
fer  primarily  to  procedures  adopted  in  the  UK.  In  the  UK,  the  majority  of  dredged  mate¬ 
rial,  particularly  from  estuaries,  ports,  and  harbors  for  navigation  purposes,  is  replaced 
in  the  sea  either  for  disposal  or  beneficial  use.  The  main  UK  legislation  for  sea  disposal  is 
the  Food  and  Environment  Protection  Act  (1985),  and  licensing  under  this  legislation  is 
controlled  by  the  Ministry  of  Agriculture,  Fisheries  and  Food  (MAFF)  in  England,  the 
Scottish  Office  Agriculture  and  Fisheries  Department  (SOAEFD)  in  Scotland,  and  the 
Department  of  the  Environment  in  Northern  Ireland  (DoENI).  Prior  to  dredging,  an  as¬ 
sessment  is  carried  out  for  the  licensing  authority,  which  takes  into  account  the  charac¬ 
teristics  of  the  material  to  be  dredged  and  also  the  characteristics  of  the  deposit  site. 

The  characteristics  of  the  dredged  material  that  are  required  for  licensing  are  quantity, 
sediment  type,  and  chemical  nature.  For  the  majority  of  sediments,  the  determinands 
measured  are  those  defined  in  the  technical  supplement  to  the  Oslo  Convention  (OS- 
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CINE)  guidelines.  In  England  and  Wales,  samples  are  routinely  analyzed  for  metals,  Le., 
mercury,  cadmium,  lead,  zinc,  copper,  nickel,  chromium,  and  sometimes  arsenic.  Most 
sediments  are  also  analyzed  for  organotin  compounds.  Analyses  for  PCBs  and  orga- 
nochlorine  pesticides,  petroleum  hydrocarbons,  and  other  determinands  are  carried  out 
as  appropriate  in  particular  situations  where  contamination  is  suspected,  or  where  back¬ 
ground  information  on  the  levels  of  these  contaminants  in  UK  dredged  material  is  re¬ 
quired.  The  use  of  sediment  bioassays  in  the  licensing  process  is  currently  being 
investigated,  although  the  way  in  which  these  tests  will  be  incorporated  is  yet  to  be  deter¬ 
mined.  A  number  of  surveys  have  been  conducted  to  assess  biological  effects  of  dredged 
sediments  using  a  battery  of  acute  and  chronic  bioassays  including  Corophium  volutator 
and  Arenicola  marina. 

Monitoring  the  effects  of  disposal  at  sea  is  also  a  requirement  of  the  international  conven¬ 
tions  and  is  carried  out  by  the  licensing  authorities  in  the  UK.  It  allows  the  authorities  to 
confirm  the  predicted  impacts  of  disposal  and  to  take  early  action  if  unexpected  impacts 
occur,  and  it  provides  a  basis  for  future  licensing  policy.  Monitoring  investigations  take 
place  at  a  number  of  disposal  sites  and  include  these  aspects: 

1)  Sampling  and  analysis  (physical  and  chemical)  of  seabed  sediments 

2)  Recording  of  seabed  topography,  using  sidescan  sonar  and  photography 

3)  Measurement  of  water  currents 

4)  Benthic  community  assessment 

5)  Chemical  analysis  of  fish  and  shellfish 

In  1992,  a  group  was  set  up  by  MAEF  to  develop  detailed  guidelines  for  monitoring  at 
sewage-sludge  and  dredged  material  disposal  sites  (Group  Co-ordinating  Sea  Disposal 
Monitoring  [GCSDM]).  One  of  the  outputs  of  the  group  was  a  list  of  “Sediment  Action 
Limits”  (levels  which,  if  exceeded,  indicate  that  studies  should  be  initiated  to  investigate 
the  availability  of  the  contaminant  to  organisms;  MAEF  1994).  These  were  derived  using 
the  EqP  approach  for  a  number  of  substances  on  UK  priority  hazardous  substances  lists. 
Field  studies  are  now  being  conducted  to  assess  the  validity  of  these  guidelines.  Other 
recommendations  made  by  the  GCSDM  were  that  chemical  monitoring  of  disposal  sites 
should  be  combined  with  measures  of  benthic  community  structure  and  sediment  toxic¬ 
ity.  The  tools  with  which  to  carry  out  such  investigations  are  currently  under  develop¬ 
ment  and  validation.  A  report  on  monitoring  and  assessment  of  marine  benthos  at  UK 
dredged  material  disposal  sites  has  recently  been  published  by  the  GCSDM  (SOAEFD 
1996). 

20.5  Conclusions 

There  are  many  areas  of  overlap  between  current  procedures  in  Europe  and  those  recom¬ 
mended  in  previous  chapters.  Some  areas  of  common  research  include  these: 

1)  Development  of  triggers  for  product  assessment  based  on  adsorption,  persis¬ 
tence,  and  toxicity 

2)  Development  of  a  risk  assessment  framework  for  prospective  testing 
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3)  Standardization  of  existing  whole-sediment  toxicity  test  methods 

4)  Refinement  of  exposure  models 

5)  Development  of  artificial  sediments  for  product  testing 

6)  Development  of  integrated  guidelines  for  site  cleanup  and  dredged  material 
disposal  based  on  sediment  chemistry,  toxicity,  and  biology 

7)  Development  of  chronic  and  in  situ  bioassays  for  all  SERA  applications 
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The  following  definitions  obtained  from  USEPA  (1994)  and  ASTM  (1995a-1995d)  are 
applicable  to  many  of  the  terms  listed  below. 

accuracy  Combination  of  bias  and  precision  for  a  procedure  which  reflects  the 
closeness  of  a  measured  value  to  a  true  value. 

assessment  endpoint  An  explicit  expression  of  the  environmental  value  to  be  protected, 

bias  Consistent  deviation  of  measured  values  from  the  true  value  caused  by 
systematic  errors  in  a  procedure  (difference  between  the  true  values 
from  the  mean  value  determined  by  using  a  large  number  of  replicate 
determinations). 

bioaccumulation  The  net  accumulation  of  a  substance  by  an  organism  as  a  result  of 
uptake  from  all  environmental  sources. 

bioaccumulation  factor  Ratio  of  tissue  residue  to  source  compartment  ,  sediment) 

contaminant  concentration  at  steady-state. 

bioconcentration  The  net  assimilation  of  a  substance  by  an  aquatic  organism  as  a  result 

of  uptake  directly  from  aqueous  solution, 

bioconcentration  factor  Ratio  of  tissue  residue  to  water  contaminant  concentration  at  steady  - 

(BCF)  state. 

biota-to-sediment  The  ratio  of  lipid-normalized  tissue  reside  to  organic-carbon- 

accumulation  factor  normalized  sediment  contaminant  concentration  at  steady  state, 

(BSAF)  with  units  of  g-carbon/g-lipid. 

characterization  of  Portion  of  the  analysis  phase  of  ecological  risk  assessment  that 

ecological  effects  evaluates  the  ability  of  a  stressor  to  cause  adverse  effects  under  a 

particular  set  of  circumstances. 

community  An  assemblage  of  populations  of  different  species  within  a  specified 

location  in  space  and  time. 

conceptual  model  A  model  which  describes  a  series  of  working  hypotheses  as  to  how  the 

stressor  might  affect  ecological  components.  The  model  also 
describes  the  ecosystem  potentially  at  risk,  the  relationship  between 
measurement  and  assessment  endpoints,  and  exposure  scenarios, 

contaminated  sediment  Sediment  containing  chemical  substances  at  concentrations  that  pose 
a  known  or  suspected  threat  to  environmental  or  human  health. 

control  sediment  A  sediment  which  is  essentially  free  of  contaminants  and  is  used 

routinely  to  assess  the  acceptability  of  a  test. 

cumulative  effects  The  combined  effects  of  multiple  stressors  ( e.g.,  metals  and  pesticides) 

or  multiple  events  {e.g. ,  placement  of  dredged  material  from  several 
different  projects  at  a  site  over  time). 

depuration  Loss  of  a  substance  from  an  organism  as  a  result  of  any  active  ( e.g. , 

metabolic  breakdown)  or  passive  process  when  the  organism  is 
placed  into  an  uncontaminated  environment. 

direct  effect  The  stressor  acts  on  the  ecological  component  of  interest  itself,  rather 
than  through  effects  on  other  components  of  the  ecosystem. 

ecological  component  Any  part  of  an  ecological  system,  including  individuals,  populations, 

communities,  and  the  ecosystem  itself. 
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ecological  risk  assessment 
(ERA) 

ecosystem 

elimination 

exposure 

exposure  profile 

exposure  scenario 

indirect  effect 
measurement  endpoint 

median  lethal 
concentration  (LC50) 

no  observed  effect 
concentration  (NOEC) 

population 

pore  water 
precision 

recovery 
reference  sediment 
reference  toxicity  tests 


The  process  that  evaluates  the  likelihood  that  adverse  ecological 
effects  may  occur  or  are  occurring  as  a  result  of  exposure  to  one  or 
more  stressors. 

The  biotic  community  and  abiotic  environment  within  a  specified 
location  in  space  and  time. 

General  term  for  the  loss  of  a  substance  from  an  organism  which 
occurs  by  any  active  or  passive  means. 

Co-occurrence  of  or  contact  between  a  stressor  and  an  ecological 
component. 

The  product  of  a  characterization  of  exposure  in  the  analysis  phase  of 
ecological  risk  assessment.  The  exposure  scenario  summarizes  the 
magnitude  and  spatial  and  temporal  patterns  of  exposure  for  the 
scenarios  described  in  the  conceptual  model. 

Set  of  assumptions  concerning  how  an  exposure  may  take  place, 
including  assumptions  about  the  exposure  setting,  stressor 
characteristics,  and  activities  that  may  lead  to  exposure. 

The  stressor  acts  on  supporting  components  of  the  ecosystem,  and 
these  in  turn  have  an  effect  on  the  ecological  component  of  interest. 

A  measurable  characteristic  which  is  related  to  the  valued 
characteristic  chosen  as  the  assessment  endpoint.  Measurement 
endpoints  are  often  expressed  as  the  statistical  or  arithmetic 
summaries  of  the  observations  which  comprise  the  measurement. 

A  statistically  or  graphically  estimated  concentration  which  is 
expected  to  be  lethal  to  50  percent  of  a  group  of  organisms  under 
specified  conditions. 

The  highest  level  of  a  stressor  evaluated  in  a  test  which  does  not  cause 
statistically  significant  differences  from  the  controls.  Same  definition 
applies  to  the  no  observed  effect  level  (NOEL). 

An  aggregate  of  individuals  of  a  species  within  a  specified  location  in 
time  and  space. 

Water  occupying  space  between  sediment  or  soil  particles. 

Measure  of  agreement  among  replicable  analyses  of  a  sample. 
Precision  is  measured  by  repeatability  and  reproducibility. 

Partial  or  full  return  of  a  population  or  community  to  a  condition 
that  existed  before  the  introduction  of  the  stressor. 

A  whole  sediment  near  an  area  of  concern  used  to  assess  sediment 
conditions  exclusive  of  material(s)  of  interest. 

A  test  with  a  high-grade  reference  material  conducted  in  conjunction 
with  sediment  tests  to  determine  possible  changes  in  condition  of  the 
test  organisms. 
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relative  bias 


repeatability 


reproducibility 


risk  characterization 


sediment 
spiked  sediment 

steady  state 

stressor 

stressor-response  profile 

trophic  levels 
true  value 


whole  sediment 


xenobiotic 


difference  between  the  mean  value  as  determined  by  a  method  using  a 
large  number  of  replicate  determinations  and  the  true  value  of  the  test 
sample. 

Measure  of  the  degree  of  agreement  among  replicate  analyses  carried 
out  simultaneously  or  in  rapid  succession  by  the  same  operator  using 
the  same  apparatus  under  the  same  conditions  for  the  analysis  of  the 
same  sample.  " 

Measure  of  the  degree  of  agreement  among  replicate  analyses  carried 
out  by  operators  in  different  laboratories  using  different  apparatus 
under  different  conditions  for  the  analysis  of  the  same  sample. 

A  phase  of  ecological  risk  assessment  which  incorporates  the  results 
of  the  exposure  and  ecological  effects  analyses  to  evaluate  the 
likelihood  of  adverse  ecological  effects  associated  with  exposure  to  a 
stressor.  The  ecological  significance  of  the  adverse  effects  is  discussed, 
including  consideration  of  the  types  and  magnitudes  of  the  effects, 
their  spatial  and  temporal  patterns,  and  the  likelihood  of  recovery. 

Particulate  material  which  usually  lies  below  water. 

A  sediment  to  which  a  material  has  been  added  for  experimental 
purposes. 

A  "constant"  tissue  residue  resulting  from  the  balance  of  the  flux  of  a 
compound  into  and  out  of  the  organism.  Operationally  defined  by 
no  statistically  significant  difference  in  three  consecutive  sampling 
periods. 

Any  physical,  chemical,  or  biological  entity  which  can  induce  an 
adverse  response. 

The  product  of  characterization  of  ecological  effects  in  the  analysis 
phase  of  ecological  risk  assessment.  The  stressor-response  profile 
summarizes  the  data  on  the  effects  of  a  stressor  and  the  relationship  of 
the  data  to  the  assessment  endpoint. 

A  functional  classification  of  taxa  within  a  community  which  is  based 
on  feeding  relationships. 

The  known  value  (i.e.,  the  actual  quantitative  valued  implied  by  the 
preparation  of  the  sample).  Since  the  known  value  does  not  always 
exist,  it  is  often  considered  the  value  towards  which  the  average  of 
single  results  obtained  by  n  laboratories  as  n  approaches  infinity. 
Consequently,  such  a  true  value  is  associated  with  a  particular 
method. 

Sediment  and  associated  pore  water  which  have  had  minimal 
manipulation.  Synonymous  with  bulk  sediment. 

A  chemical  or  other  stressor  which  does  not  occur  naturally  in  the 
environment. 
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ecological  relevance  116, 118, 143, 149, 
186,  188 

international  perspectives  373-374 
methodological  uncertainty  201-202, 
271,283-289,  292,  298,  310-312, 
314,  317,  325,  340 
product  safety  12, 15,  32 
sediment  94 
synoptic  94,  326,  339 
screening  12-15,  19,  27-28, 30-31, 

33,  36,66,  75,  78,83-84,94-95, 

102,  104,  106,  169,204,265,354, 
356,  359 

screening-level  concentration  (SLC) 

333- 338 

seasonal  scales.  See  variability 
seasonal  variability.  See  variability 
sediment  chemistry  48,  64, 297-298,  301, 
319,  327-329,  331-333,  338,375 
sediment  criteria  78 

sediment  phases  299-300,  302,  310,  327 
sediment  quality  45,  49,  62,  370-371, 

373 

sediment  quality  criteria  (SQC)  18,  76, 

103,  no,  127,  191,  264,  332, 

334- 337,  354,362,  371 
sediment  quality  guidelines  (SQGs) 

103-104,  191,  257,  264-265,  280, 
297-298,  328,  332-338,340 
sediment  quality  standards  370,  372 
sediment  quality  triad  5, 101, 110, 125,  149, 
298,  307,  319,  338,  340,  357 
selenium  214,  219-220,  222 
SEM.  See  simultaneously  extracted  metals 
sensitivity  302,  307,  315-316,  324-325, 
329-330,  335-336 

simultaneously  extracted  metals  (SEM)  31, 
215,258-260,  328,  330,  333-337 
sorption  239,  246-247,  250 
sorptivity  13-14,  18 


source  definition  85 
spatial  scales.  See  variability 
speciation.  metals 
spiked  sediment  180,  214,  220,  261-262, 
371 

SQC.  See  sediment  quality  criteria 
SQG.  See  sediment  quality  guidelines 
stakeholders  18,  42,  65,  69,  78-79,  89, 167, 
357 

standardization 

contaminated  sites  109 
ecological  relevance  116 
international  perspectives  354-355, 

359,  362,371,  375 
methodological  uncertainty  199,  287, 

297,  300,  303,  308,  317,  325,  331,  336 
product  safety  11,  34,  39 
statistics  140-141, 144-145, 149-151 
steady  state  35,  46,  219,  225-227,  240, 
244,  246,  324,  340 
stressors  361 

contaminated  sites  74,  87,  91,  106 
dredging  50-57,  59-62,  64,  66,  68 
ecological  relevance  123,  135,  141, 

150,  157-158,  162,  167,  169, 
173-174,  177,  179-182,  184-185, 
189-191 

methodological  uncertainty  273-279, 

292,  301,  303,  307,  310,  313-314, 
328,  335,  340 
structural  changes  179, 276 
structural  endpoints  124 
structural  limitations  287 
Superfund  74-76,  79,  112 
suspended  solids  299-300,  302-304 

T 

TEL.  See  threshold  effect  level 
temporal  scales.  variability 
thermodynamic  equilibrium  213,  218-219. 

See  also  equilibrium  partitioning 
threshold  effect  level  (TEL)  333 
TIE.  See  toxicity  identification  evaluation 
tiered  approach  75,  79,  83-84,  100,  113 
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tissue  concentrations  89,  94,  108 
tissue-residue-based  toxicity  211,  228-230 
TOC.  See  total  organic  carbon 
total  organic  carbon  (TOC)  213-215 
total  organic  carbon  (TOC)  normalization. 

See  equilibrium  partitioning 
Toxic  Substances  Control  Act  (TSCA)  16 
toxic  unit  (TU)  concentration  257-259,  261 
toxicity  identification  evaluation  (TIE) 
contaminated  sites  109 
ecological  relevance  119,  121,  168, 184 
methodological  uncertainty  259-261, 
266, 274, 285,  301,  304,  309-310, 
338-340 

toxicity  tests.  See  endpoints 
transport  239-240,  242-244,  246-250 
trophic  levels  53,  60, 88, 115, 123-124, 

126, 129-130, 169, 178-179, 218, 230 
TSCA.  See  Toxic  Substances  Control  Act 

u 

uncertainty 

analysis  199-204,  209,  271,  358 
contaminated  sites  79,  83,  88,  95, 

98,  100-102,  104-106 
dredging  55,  59,  66-68 
false  negative  error  104,  271,  280-283, 
288,  292,357 

false  positive  error  271,  280-283,  287, 
292 

prediction  error  274 
product  safety  19,26,29,  31,  33-34, 
36-38 

urban  runoff  360-361 

V 

variability 

seasonal  216, 218, 325 
spatial  56, 67-68, 70, 91-92, 136-137, 
140, 142, 144, 146, 149-151, 168, 
175-176, 179-180, 183-185, 191, 278, 
325 

temporal  56,  67,  70, 91-92, 136-137, 

140, 142, 144-145, 149-151, 168, 175, 
179-180, 183-185, 191 


w 

weight  of  evidence  36, 79-80,  100-101, 
103, 106-107,  113, 168, 181, 185, 

260,  338,340 
whole  sediment  95 

product  safety  15,  24,  31,  34,  36 
dredging  42,  45^6,  49,  62,  67,  69 
ecological  relevance  117,  149,  183, 188 
methodological  uncertainty  261, 266, 
299-301,  309-310,  327-331 
whole  sediment  toxicity  tests.  See  endpoints 
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A  Professional  Society  for  Environmental  Scientists  and  Engineers  and  Related 
Disciplines  Concerned  with  Environmental  Quality 

The  Society  of  Environmental  Toxicology  and  Chemistry  (SETAC),  with  offices  in  North 
America  and  Europe,  is  a  nonprofit,  professional  society  that  provides  a  forum  for  indi¬ 
viduals  and  institutions  engaged  in  the  study  of  environmental  problems,  management 
and  regulation  of  natural  resources,  education,  research  and  development,  and  manufac¬ 
turing  and  distribution. 

Goals 

•  Promote  research,  education,  and  training  in  the  environmental  sciences 

•  Promote  systematic  application  of  all  relevant  scientific  disciplines  to  the 
evaluation  of  chemical  hazards 

•  Participate  in  scientific  interpretation  of  issues  concerned  with  hazard  assess¬ 
ment  and  risk  analysis 

•  Support  development  of  ecologically  acceptable  practices  and  principles 

•  Provide  a  forum  for  communication  among  professionals  in  government, 
business,  academia,  and  other  segments  of  society  involved  in  the  use,  protec¬ 
tion,  and  management  of  our  environment 
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•  Annual  meetings  with  study  and  workshop  sessions,  platform  and  poster 
papers,  and  achievement  and  merit  awards 

•  Monthly  scientific  journal,  Environmental  Toxicology  and  Chemistry,  SETAC 
newsletter,  and  special  technical  publications 

•  Funds  for  education  and  training  through  the  SETAC  Scholarship/Fellowship 
Program 

•  Chapter  forums  for  the  presentation  of  scientific  data  and  for  the  interchange 
and  study  of  information  about  local  concerns 

•  Advice  and  counsel  to  technical  and  nontechnical  persons  through  a  number  of 
standing  and  ad  hoc  committees 

Membership 

SETAC  s  growing  membership  includes  more  than  5,000  individuals  from  government, 
academia,  business,  and  public-interest  groups  with  technical  backgrounds  in  chemistry, 
toxicology,  biology,  ecology,  atmospheric  sciences,  health  sciences,  earth  sciences,  and 
engineering. 

If  you  have  training  in  these  or  related  disciplines  and  are  engaged  in  the  study,  use,  or 
management  of  environmental  resources,  SETAC  can  fulfill  your  professional  affiliation 
needs.  Membership  categories  include  Associate,  Student,  Senior  Active,  and  Emeritus. 

For  more  information,  contact  SETAC,  1010  North  12th  Avenue,  Pensacola,  Florida; 
T  904  469  1500;  F  904  469  9778;  E  setac@setac.org;  http://www.setac.org. 


